Gene Cphy_1723 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_1723 
Symbol 
ID5741474 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp2122313 
End bp2123833 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content34% 
IMG OID641292823 
Productextracellular solute-binding protein 
Protein accessionYP_001558834 
Protein GI160879866 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGATA GCTTAAGGGT TATAATCTCA ACACTAGGTA TTATTTCAAT AATTGTTATG 
ATGATAGGAT GTAGCCTCAG GAGTGAGGAT GAAGTCTATA CTCAGAATGA AGTCTCTAAA
CAAAATAAAG TTGTGGAACT AATATGGTAT CAACTAGGTG CACCTCAAGA GGATGGTGAT
TTAGTTCTTA AAACAGTCAA TGATTATATA AAGGATAAAA TTGGGGTCAT AATAAAAATT
AAGTATATTG GATGGGTGGA TTATAATCAT AAAACACAGC TAGTTATCAA TGCCGGAGAG
CCTTTTGATT TAATGTTTAC GAGTTCATGG GCGAATGATT TTTCACAGAA TGCACAAAAA
GGTGCTTTTT TGTCGTTGGA TGATTTACTA CCTGTATATG GAAAGGAGAT GCTTCAGAAT
ATTGACTTAA GGTTTTGGGA AGCCGCAGAG GTACATGGAA AGATTTATGC AGTACCGAGT
GAAAAAGAGA TTGTAAATAT GCCAATGTGG ACATTTACCA AAGAGTATGT TGACAAATAT
CAAATTCCGT ATGAAAAGCT TCATACGTTA GAAGATTTAG AACCATGGTT AAAATTAATA
AAAGAGAATG AACCAGATGT TATTCCGCTA TATTTAACCA AAGATTTCTG CCCTCCGATT
TATATGGATG AAATTATATA TCCATTAGGC ATTGAGTATT CCTATGATAC GAATAATCTA
ATTGTTAGTA ATTTGTTTGA AACAGAAACC ATGAAATCAA CCTTAAGAAC ACTTCGAGAG
TATTATTTAA AAAAATATAT TAATTGGAAT GCAGCCACAA TCTCAGATGA TAAAACCGTG
AAGCGCTTTG TAACGAAGGG TGATGGGCAG CCATATGCGG ATCGAATTTG GTCAAGAAAC
CTTGGTTATG AAGTAGTGAC AAGTACGATC ATGGAAACAA AAATTTCTAA TTATTCGGCA
CGTGGTGCCA TGACAGCGAT CTCAAGAACA TCAAAGCACC CAGAAAAAGC AATCCAATTT
TTAAATCTAT TAAACACCGA TGAGTATTTG CGTAATTTGA TTAATTATGG AATAGAAGGT
GTACATTACG ATAAAGTAAA TGCTAGTGAT GATGAATTAA GAAGTGTCGA AGGTTCGGAT
CATGTTTATC CATTTAAAAT TAGATATAAT AATGACAATA TGAAACGTTA TGATGTTCCT
TATTGGGTGC AAGGTGGTCT ATTTAATACT TATGTTCCTG AGGGAGAACC TCTTGATAAA
TGGCAGAAGT TCAGAGAATT AAATAAGAAT GCAGAGACAG CACCAACGTT CGGCTTTGAT
TTTGATTCAG ATGCTGTGAG CTCTCAAATA GAGAAAATTA AAAGCGTAAT GAATGAATTT
GTTCCGCCAC TTTATACCGG TAGTGTAGAA CCGGATGATA TTTTACCTAA ATTGCAACAA
AAACTAAAAG AAAATGGAAT AGATAAAATA CAAGAAGAGA TTCAAATTCA ATTAGATGAA
TGGAAGATGA ACGGACAATA G
 
Protein sequence
MKDSLRVIIS TLGIISIIVM MIGCSLRSED EVYTQNEVSK QNKVVELIWY QLGAPQEDGD 
LVLKTVNDYI KDKIGVIIKI KYIGWVDYNH KTQLVINAGE PFDLMFTSSW ANDFSQNAQK
GAFLSLDDLL PVYGKEMLQN IDLRFWEAAE VHGKIYAVPS EKEIVNMPMW TFTKEYVDKY
QIPYEKLHTL EDLEPWLKLI KENEPDVIPL YLTKDFCPPI YMDEIIYPLG IEYSYDTNNL
IVSNLFETET MKSTLRTLRE YYLKKYINWN AATISDDKTV KRFVTKGDGQ PYADRIWSRN
LGYEVVTSTI METKISNYSA RGAMTAISRT SKHPEKAIQF LNLLNTDEYL RNLINYGIEG
VHYDKVNASD DELRSVEGSD HVYPFKIRYN NDNMKRYDVP YWVQGGLFNT YVPEGEPLDK
WQKFRELNKN AETAPTFGFD FDSDAVSSQI EKIKSVMNEF VPPLYTGSVE PDDILPKLQQ
KLKENGIDKI QEEIQIQLDE WKMNGQ