Gene Franean1_5458 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5458 
Symbol 
ID5673789 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6600220 
End bp6601470 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content74% 
IMG OID641244313 
ProductABC transporter related 
Protein accessionYP_001509719 
Protein GI158317211 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3839] ABC-type sugar transport systems, ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.665559 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCACGA TCGAGTTCCG CGACGTCACG CTCACCTACG ATGTCCGGCC GATACTCGAC 
AACCTCAGCC TCGTGATCGG GGACGGCGAG TTCCTCGTCC TCGTCGGTCC CTCCGGCAGC
GGCAAGACGA CGGCGCTGCG GATCGTCGCC GGCCTGCTCG CGCCCACCGC GGGCCAGGTC
CTCGTCGGCG GCCGGGACGT CACCCGGGTG GCGCCGGCCG ACCGTGACCT CGCGATGGTC
TTCCAGAGCT ACGCCCTGTA CCCGCACATG ACCGTGCGTC GCAACATGGA GTTCGGCCTG
AAGCTCGCCG GGGTCGACCG GCGGGAACGC GACAGCAGAG TGGCCGCGGC GGCCGAGACG
CTCGGCCTGA CCGAGCTCCT CGACCGGCGG CCGCGGGCGC TGTCCGGCGG CCAGCGCCAG
CGTGTCGCGA TGGGGCGTGC GCTCGTCCGT CAGCCGCGGG CGTTCCTGAT GGACGAGCCG
CTGTCGAACC TGGACGCCAA GCTGCGGGTG CGGGTCCGCG CGGAGATCGC TCGCATCCAG
CGCTCGCTGG GCACCACCAC GCTCTACGTG ACCCACGACC AGACCGAGGC GATGACGATG
GCCGACCGGG TCGCCGTCCT GCACGACGGG CGGCTGCAGC AGGTCGGCAC GCCCGATGAC
CTGTTCAACC GCCCGGCGAA CGTCTTCGTC GCCGCGTTCA TCGGCAGCCC GCCGGCCAAC
CTCGTCCCCG GCCGGCTGGT CTCAGAGGAT CATGCCGTCG CGCTGCGGGT CGGCGACCAG
ACCCTCGTGC TGCCAGTGGA CGTCGCCGCC GGGCTCACCC CGTCGGCCGG CACCGAGGTG
ATCGTCGGCG TGCGGCCGCA TGACGTCCAG ATCGAGCCGC CGCCGGGACC AGCCCTGATC
CTCGACGTCG ACGTCGACCT TGTGGAGCGG CTCGGCACCG AGACTCTCGC CCACGGCGAG
ATGGTGACGG GTGCCCTCAC CGGGAGCGCG GCGCGGGCGG CGATCGCGCT CGCCGCCGGC
GACGACGAGC TCACCGGCCG GGAGTCCGGC CAGGAGGCCG GTGATGAATC CGGGCGCGCG
CCCGGCGGCC GTCCGGGGAC ATCCCGTTTC ACCGCCGCGC TGAGCCCGCG GACCGACGTC
ACCGCCGCCG GGCGGCTCCG GCTGTACGCC GCGGCGGACC GGCTCCACCT GTTCGACGCC
GATACGGGCG CGACGCTGCG CCCCGCGCCC GCCGCCCTCG CCGCCGCCTG A
 
Protein sequence
MSTIEFRDVT LTYDVRPILD NLSLVIGDGE FLVLVGPSGS GKTTALRIVA GLLAPTAGQV 
LVGGRDVTRV APADRDLAMV FQSYALYPHM TVRRNMEFGL KLAGVDRRER DSRVAAAAET
LGLTELLDRR PRALSGGQRQ RVAMGRALVR QPRAFLMDEP LSNLDAKLRV RVRAEIARIQ
RSLGTTTLYV THDQTEAMTM ADRVAVLHDG RLQQVGTPDD LFNRPANVFV AAFIGSPPAN
LVPGRLVSED HAVALRVGDQ TLVLPVDVAA GLTPSAGTEV IVGVRPHDVQ IEPPPGPALI
LDVDVDLVER LGTETLAHGE MVTGALTGSA ARAAIALAAG DDELTGRESG QEAGDESGRA
PGGRPGTSRF TAALSPRTDV TAAGRLRLYA AADRLHLFDA DTGATLRPAP AALAAA