Gene Franean1_4649 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4649 
Symbol 
ID5672992 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5548222 
End bp5549334 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content65% 
IMG OID641243507 
Producttransposase IS116/IS110/IS902 family protein 
Protein accessionYP_001508923 
Protein GI158316415 
COG category[L] Replication, recombination and repair 
COG ID[COG3547] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCAGCA TGATGCTTTC CCGTGACGAT GTCATCGTCG GCGTCGACAC TCACAAAGAT 
CAGCATGTTG CCGTGCTACT CGACGGTCTC GGAGGCCGTC TCGCGGACCT GGTGATCCCG
GCGACCCCGC CCGGCTTCGA GATGCTGCTC GCATTCTGCC TGGCCCGGGT GCATTCTCCT
GGGCGGCTTG TTGCCTTTGG GGTCGAGGGA ACCGGCTCCT ACGGTCTGGG ACTGGCGCGT
TTTCTGCGCC GGCATGGCCA CGACGTCCGG GAAGTCAGCC GTCCACCACG GAAAGGTGAG
CGACGTCAGG CGGGCAAGAC CGACACAATC GACGCCGAGC ATGCCGCGCG CCAGGTCGTC
GCCGGCGTGC TGACCGCGAC GCCGAAGACG GCCGATGGGT CGGTCGAGGC GCTTCGTCTG
ATCAAGGTTG CCCGGGACAC TGCGGTGAAG GCCCAGTCAG CAGCCATGAT CACGTTGAAG
GCGACGTTGG TGACCGCCGA CGACGAGCTC CGGGGCGCGC TCGAACCTCT CACAGATCAC
CGCCTGATCG AAGCCTGCGC GGCGCTCGAG TGCGTGGGGG CGCCGACGAC ACCCGCGAAG
GCGATGAGGC ACGTGCTGGC GTCGCTGGCG CGGCGGTGGC TGAGTCTGCA CGAAGAAGTC
AAGTCACTGA GCTGGCATCT CAAACATCAG ACAAAAACCG CCGCGCCACG GCTTGTCGAA
GCAGTCGGTA TCGGCCCTGA CACGGCGGCT GAAATGCTGA TCGCCGCCGG GGACAACACC
GACCGGATCC GCTCGGAATC GGCGTTCGCG AAGCTCTGCG GCGTGAGTCC GATCCCGGCG
TCCTCCGGGA AGACGCACCG TCACAGGCTC AACCGAGGTG GGAACCGGCA GGCCAACGCC
GCGCTCTACC GTACCGTCAT CGTGCGCATG CGATGGCATC AACCTACCAT CGATTACGTC
GAGCGGCGCA CCGCAGAAGG ACTTACGAAG CGTGAGATCA TCCGCTGCCT GAAACGATAC
GTCGCGCGGG AACTCTATCG CCTTCTACCG CCATCGAACG TGGTCGAGTA CAGCCGCGCC
GCGGCTTCTG ATCCGTCGCC TCAGGCCGCT TGA
 
Protein sequence
MPSMMLSRDD VIVGVDTHKD QHVAVLLDGL GGRLADLVIP ATPPGFEMLL AFCLARVHSP 
GRLVAFGVEG TGSYGLGLAR FLRRHGHDVR EVSRPPRKGE RRQAGKTDTI DAEHAARQVV
AGVLTATPKT ADGSVEALRL IKVARDTAVK AQSAAMITLK ATLVTADDEL RGALEPLTDH
RLIEACAALE CVGAPTTPAK AMRHVLASLA RRWLSLHEEV KSLSWHLKHQ TKTAAPRLVE
AVGIGPDTAA EMLIAAGDNT DRIRSESAFA KLCGVSPIPA SSGKTHRHRL NRGGNRQANA
ALYRTVIVRM RWHQPTIDYV ERRTAEGLTK REIIRCLKRY VARELYRLLP PSNVVEYSRA
AASDPSPQAA