Gene Franean1_3100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3100 
Symbol 
ID5671479 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3658590 
End bp3659618 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content70% 
IMG OID641241998 
Producttransposase IS116/IS110/IS902 family protein 
Protein accessionYP_001507418 
Protein GI158314910 
COG category[L] Replication, recombination and repair 
COG ID[COG3547] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.185223 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTCCAT CGTATGAGGG TCGCCAGATT GTGGGAATCG ATCTCCATCG GCGGCGGTCG 
GTGATCGTGC GGATGACCCC GGACGGGTAT CCGCTGGGAA CGGTTCGGAT CAGCAACGAT
GCTGCCACCC TCGCTGCTGA GATCAGCCGT GCCGGTGAGC ACCCCGACGT GGTGTTGGAG
GCGACGTACG GCTGGTACTG GGCGGCGGAT GTGCTGGCCG CCGCTGGCGC GCAGGTTCAT
CTCGCGCACC CGCTGGGCGT GAAGGGTTTC GCCTACCGGC GGGTGAAGAA CGACGTCCGT
GACGCCGCGG ATCTCGCGGA CCTGCTGCGT ATGGGACGTC TGCCCGAGGC GTGGGTCGCT
CCACCGCCGG TGCGGGAACT GCGGGAGACC GTCCGTCACC GGGCGGCGCT CGTCGCGATC
CGGTCGGCGT GCAAGGCCCA GATCCACGCG GTCCTCGCGA AGAACGGTGT CGCGGTGCCC
ATGACCGACC TGTTCGGCCA GGCCGGAACC GACCTCCTCG GCCAGGTCGA GCTCCCGAGC
CCGTTCCACG CCCGCGTCAC GAGTCTGCGC CGCCTCATCG ACCTGCTCGA CTTCGAAATC
GACGCGGCCG CCTGTCAGCT CGCCGGCCGG CTCGCGCGGG ATCCGGGCTA TCAGGCGCTG
CTGGTCCTGC CCGGGGTCGG GAAGACCTTG GCCGCGGTGT TCCTCGCCGA GATCGGGGAC
ATCACCCGCT TCCCCACGCC CGGTCACCTT GCCAGTTGGG CTGGTCTCAC TCCCCGGCAC
CGTGAGTCCG ACACCACCGT GCACCGCGGC CACATCACCA AACAGGGCTC CTCCCTGATT
CGCTGGGCCG CGATCGAAGC CGTGTCGATC CTGCCTCCGA CGACCCCGGT CCTGGGCCCG
ACCAAGACCC GGGTCGCTGC CCGCCGCGGC ACCAACATCG GCAAGGTCGC CGCGGCCCGC
AAGCTGCTCA CGTTCGTCTT CTACGCGCTG CGCGACGGTG AGGTCCGCGC GCTGCACACG
GCGGCGTGA
 
Protein sequence
MSPSYEGRQI VGIDLHRRRS VIVRMTPDGY PLGTVRISND AATLAAEISR AGEHPDVVLE 
ATYGWYWAAD VLAAAGAQVH LAHPLGVKGF AYRRVKNDVR DAADLADLLR MGRLPEAWVA
PPPVRELRET VRHRAALVAI RSACKAQIHA VLAKNGVAVP MTDLFGQAGT DLLGQVELPS
PFHARVTSLR RLIDLLDFEI DAAACQLAGR LARDPGYQAL LVLPGVGKTL AAVFLAEIGD
ITRFPTPGHL ASWAGLTPRH RESDTTVHRG HITKQGSSLI RWAAIEAVSI LPPTTPVLGP
TKTRVAARRG TNIGKVAAAR KLLTFVFYAL RDGEVRALHT AA