Gene Cphy_3027 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_3027 
Symbol 
ID5743353 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp3697330 
End bp3698733 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content37% 
IMG OID641294128 
Productextracellular solute-binding protein 
Protein accessionYP_001560123 
Protein GI160881155 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGATTAA AAAGCGCATT AAAACGTGGC TTTGCTTTCA GTTTAGCTAC AGTAATGGTT 
TTAAGTACAG CTGGATGTGG AAAATCCGAT AATACTAAGG ACAACTCGAA CGGTAATACA
CCAACAGGTA CTGAAGGTAC CGAATCTTCT CAAAAGCCAA ACAGTGATAA GCCATATGAT
GGTGTTACTG TTAAATGGGC GTTAACAGAT AACGCTGCAA CCGGTTCTGA AACAAAAGAG
ATGGTTGACT TAATCAAAGA AAAAACAGGT ATTAACGTAG AGTTTTATAT CACTCCTACA
TCAAAAGCAG GAGAAATGGA CAAGGTACTT GTAAGCTTAA TGGCAGGAGA AGCAATCGAC
ATCATCGGTA GAACTCCACT TCAGTTAGAA GAATTCTACA AAGCTGCTGT ATTAGAGCCA
ATTGATGACC TTGCAAAAGC AGATAACTAC GATATGTCAG CTATTTACGG TGACAAAATT
GTAAAATTTG AAGATAAATC TTTCGCAATG CCTGCAGAAA AGGACATTTG GTTAACTTAT
TATAATAAGA AAATCTTTGA TGACGCTAAT ATTCCATATC CAACAGCAGA AGGCTGGACA
TGGGAAAAAT ATGTTGAGAC AGCTCAGAAA CTTAACAATC CAGAGAAAAA TATCTGGGGT
TCCTTTATGA GTGATGACGT TGCTTGTAAC TATATGTTAG CTACACAAAA GGGTGTTTCT
GCCTATAAAG CAGACGGAAC AGCAAACTTT GATGATCCTG CATTCGCTGA TGCTGCAAAA
TGGTTCTTTA GTTTAGGAAA TGATCTTAAG ATTCAACCAG GTTGCATCGA TTTAGCTTCC
GGAACATATC CATATAACTC TTTCATGGTA AATGGAAATA TCGGTATGTA TGTATATGGT
GGATGGGTAG CAAGTGCATT ATCTGATAAG ACAAAATATC CAAGAGATTG GGAATTAGGA
ATCCTTCCTA TGCCATATCC AGAAGGTGAA GATCCATCTT CTTTAACAAT TACAAGTTGC
TATGCTATTC CAAAGACATC TAAGAATAAA GAAGCAGCAT TTGAAGCAAT TAAAACAATT
TGTGAAAATA AATATACTTT AGGTTATGGA CGTGTTCCAG CAAAGATTCT TACAGAAGAT
GAGGCAAAAA CATATATTGA GTCCAGCTTA CTTCCAAAAT TTAAAGATGA CAACTTAACA
GTAGATGATT TCATGAAAGG TTGGTTTGAT AACAGCAGAT TATACTTAAG TGAAAAGATT
ATGGGTACTG CTGATACAAC AATCGGTCAG ATTTACACTG AGGAAGGCCA GCTATACGGA
CAGGGACAAA AGTCACTGGA AGATACCATG AAATCTATTC AGGACAGAGC AAATGAAGCG
ATTAAAGAGG CTAATGAGCA ATAA
 
Protein sequence
MRLKSALKRG FAFSLATVMV LSTAGCGKSD NTKDNSNGNT PTGTEGTESS QKPNSDKPYD 
GVTVKWALTD NAATGSETKE MVDLIKEKTG INVEFYITPT SKAGEMDKVL VSLMAGEAID
IIGRTPLQLE EFYKAAVLEP IDDLAKADNY DMSAIYGDKI VKFEDKSFAM PAEKDIWLTY
YNKKIFDDAN IPYPTAEGWT WEKYVETAQK LNNPEKNIWG SFMSDDVACN YMLATQKGVS
AYKADGTANF DDPAFADAAK WFFSLGNDLK IQPGCIDLAS GTYPYNSFMV NGNIGMYVYG
GWVASALSDK TKYPRDWELG ILPMPYPEGE DPSSLTITSC YAIPKTSKNK EAAFEAIKTI
CENKYTLGYG RVPAKILTED EAKTYIESSL LPKFKDDNLT VDDFMKGWFD NSRLYLSEKI
MGTADTTIGQ IYTEEGQLYG QGQKSLEDTM KSIQDRANEA IKEANEQ