Gene Cphy_0494 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_0494 
Symbol 
ID5743408 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp630357 
End bp631754 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content36% 
IMG OID641291606 
Productextracellular solute-binding protein 
Protein accessionYP_001557620 
Protein GI160878652 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGTTCA TTAAGAACAG TGTTTTGAGT ATATTATTTG TTGTTGTGAT GCTGATAGGA 
TTAACAGGAT GCGATAATTT AAAAAGAAAT TCTGCTTCTA GTCTAGAGAG TAAGGTGGAA
GAGAAGAAGA AAAAAATTGT TACTGTTTGG ACAAAAGATA GGCATGATGC AGAATTTCAG
TTGGAAAAAA TAGAGGAATA TAACGCAAGT AATTCAGATA ATATTGAAAT TAGATATTCT
ATATTTTCTG ATAATTACCT TCAAGCAGTA GACTTGGCAT TTCAAAGTAA TAGCGCTCCT
GATATTCTTG TTTTTACATC ACAGGTATTT AATAATTATG TAGTGTCAGA CCAATTTGCA
GACTTGACTC CATTTATGGA TAAAGAATTC AAGGAGACGT TCTCATCCAG TATGATTGAA
GGGTTAAATG TCATAGATGG GAAATGTTAT TACATACCTA CAGCAGCAAC GATATGCCGG
TTGTTTTATA ATAAAGATAT TTTTGAGAGA GTAGGGGTTT TGGAACCACC AAGAACTTTG
GATGAGATGG TGGAGATAGC CCAAAAAATA ACCAAAGAAT TATCCGGAGA AGGTATCTAT
GGATTTTCTG CCAATATGAA GTATCCCAAT TCTGCACTGA ATCGCTCTTT GATGCCAATG
GCGCAAATGG GCCTGGGAAT TCGAAGTGGT TTCGACTTTA AAAAAGGAGT TTATGATTTT
TCTGGGTATC AAACTATTCT TGAGGAGTGG AGAACATTAT TATCGCCGGA ATGCGCTTAT
CCCAATAGTG ACTCTTTAGA TATTGATCCT CTGAGAAAAT TATTTGCTGC AGGAAAGATT
GGTATGTATA TGTCCTATGC CCATTCAGAA TTAGGAGTAT ATGAAAACCA ATTTCCGATG
GAACAGGAGT GGAGGTGCAC CGAAATACCT GTAGTTGGCG GTATCATCCT TGGTGCACAG
AATTACTCTT TAAATAATGG TTATCTATTC AATGCCAAAA GTAAAAATCT AGATGCGGCA
TGGAAGGCTT ATGTATCTGT ATTTGCTGAT ATTGATAATT TAGCGGAATA TCATTCGCAG
GGACTAGGAA TATCAACAGT TCCAAAAGTT TTAGAAAGAG CTACCTTAAA TCAAAGTTAC
ATAGACAACC CAGCGCTATT AGTTGGTGAG AACGATAAAA TATGGCCTAA GGTTCCTCAT
GAAGCAAATG TAAATGCAAT CGTCGTGGAT GGACTTGATA TGTATAATAC CTTTGCCGAA
CTAATCTACG GAACGATGGA CATAAAAGAA GGATTATCAG ATTTGACTAA AAGGTACAAT
AAGGCATATC AAGAAGGAAT TAATTCAGGG ATTGGTGAAG TATATAAAAT AGATGATTTT
GATCCACTGA ATCCATAG
 
Protein sequence
MKFIKNSVLS ILFVVVMLIG LTGCDNLKRN SASSLESKVE EKKKKIVTVW TKDRHDAEFQ 
LEKIEEYNAS NSDNIEIRYS IFSDNYLQAV DLAFQSNSAP DILVFTSQVF NNYVVSDQFA
DLTPFMDKEF KETFSSSMIE GLNVIDGKCY YIPTAATICR LFYNKDIFER VGVLEPPRTL
DEMVEIAQKI TKELSGEGIY GFSANMKYPN SALNRSLMPM AQMGLGIRSG FDFKKGVYDF
SGYQTILEEW RTLLSPECAY PNSDSLDIDP LRKLFAAGKI GMYMSYAHSE LGVYENQFPM
EQEWRCTEIP VVGGIILGAQ NYSLNNGYLF NAKSKNLDAA WKAYVSVFAD IDNLAEYHSQ
GLGISTVPKV LERATLNQSY IDNPALLVGE NDKIWPKVPH EANVNAIVVD GLDMYNTFAE
LIYGTMDIKE GLSDLTKRYN KAYQEGINSG IGEVYKIDDF DPLNP