Gene Franean1_7049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_7049 
Symbol 
ID5675360 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8603053 
End bp8604372 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content66% 
IMG OID641245895 
Productextracellular solute-binding protein 
Protein accessionYP_001511286 
Protein GI158318778 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0982583 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTAGCA CTCCGAGCGA AGCCAGATCC GGGCTCACCC GTCGCGGGGT CCTGCAGATG 
AGCGGGGCAG TGGGGCTCAC CAGTATGATT GCATCTGCCT GCGGTGCCGG TGGGCAGAGT
GGCGACACCA GCGGTTCCAA CGCCCCCCTC ACCCTGTTGT GCGAGGCCGG TGGCAAGGCG
GAGCTGACGA AGATCGCCGA GTTGTTCCAT CAGGAGACCG GTCATGCGGT CTCCTTCGTG
GAGTTACCGT ACAACGGGCT CTTCAACAGA CTGAGCAGCG AACTTTCTTC GGGCACGGTC
TCGTTCGACG TCGCGGCGGT CGACGCGATC TGGTTGTCGA CCTTCGCCGG CGCCCTGCAC
CCGCTGGACG AGCTGTTCAC CGCGGACGTC AAGTCCGACC TATTCCCGGC GCTGGTCTCC
GAGGCACAGG TCGACGGCAG GTTCGTGGCC ATGCCCACCT GGACCAACGC GGAGATCCTC
TTCTACCGGA AGGACCTGTT CGAGGCTCCG GGGGAACGGA CGGCGTTCGA GAGCCAGTTC
GGGTATCCGC TCGAAGTGCC GAAGACCTGG CAGCAGTTCG AGGACACGGC CCGCTTCTTC
ACGCGGGGCA CCGAGCTCTA CGGAACCGAC GTGAAGGGGG CGGTGGAGAC CGAGTGGCTC
GCCCACGTCC TGCAGGCGGG GTCCCCCGGT GTGGTTCTGG ACCCGGACGA CAACATCATC
ATTGACAACG AGCAGCATCT GGCCGCGCTC CGCTTCTACA GCGACCTCAA CAACCGTCAT
CGGGTGGCTC CGCCGGGAGC CGCGCAGCTC GACTGGGCCG GGGCACAGAA CCTGTTCAAC
CAGGGAAAAA CGGCGATGCT GCGCTTCTGG GCCCACGCGT TCCCGCTGAT CCCCTCGGAC
TCGCCCGTCC ACGGCAAGGT GGGGGCAGCA CCCATGATCG CGGGAAGTGC CGGGATCGCG
GCCATTCCGG GGCCATGGCA CCTGTCCGTT CCCGCGGCCG GCCGCAACAC CGAGCTGGCC
ACGGAGTTCA TCCAGTTCAG CTATGAGAAC AACGCGCTGG GCATCCAGTC CTCACTCGGC
CTGGCGGCCC GCAGATCGGC CTTCGATAAG TACTCCGACA AACCCGGCTA CGAGCACTTC
ACTCCGCTGC TGGACACCCT GTCCGCCCCG GCGACGAAGG TCCGCCCGGC GACCCCCAAA
TGGCAGCAGA TCGTCGACAC CGTCCTCGTG CCCATGCTGC AGAAGTCGCT GACCGACAAC
GCCGACTACG CAGCCCTGCT GAAGGACGCC CGCGAGGATG TGCAGCGTCT TGTCAGCTAG
 
Protein sequence
MASTPSEARS GLTRRGVLQM SGAVGLTSMI ASACGAGGQS GDTSGSNAPL TLLCEAGGKA 
ELTKIAELFH QETGHAVSFV ELPYNGLFNR LSSELSSGTV SFDVAAVDAI WLSTFAGALH
PLDELFTADV KSDLFPALVS EAQVDGRFVA MPTWTNAEIL FYRKDLFEAP GERTAFESQF
GYPLEVPKTW QQFEDTARFF TRGTELYGTD VKGAVETEWL AHVLQAGSPG VVLDPDDNII
IDNEQHLAAL RFYSDLNNRH RVAPPGAAQL DWAGAQNLFN QGKTAMLRFW AHAFPLIPSD
SPVHGKVGAA PMIAGSAGIA AIPGPWHLSV PAAGRNTELA TEFIQFSYEN NALGIQSSLG
LAARRSAFDK YSDKPGYEHF TPLLDTLSAP ATKVRPATPK WQQIVDTVLV PMLQKSLTDN
ADYAALLKDA REDVQRLVS