Gene Cphy_0487 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_0487 
Symbol 
ID5745247 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp615427 
End bp617103 
Gene Length1677 bp 
Protein Length558 aa 
Translation table11 
GC content36% 
IMG OID641291599 
Productextracellular solute-binding protein 
Protein accessionYP_001557613 
Protein GI160878645 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGATTAA AAAAAGTAAT GTGTGTAACA TTAGCAGCGG CAATGTTAAT GGGTAGTCTT 
GCCGGCTGTG CAAAAAAGGA TAATAAAGTG GACAATCCAT CCGTAACACA GCCTGCAACA
GATGAGAATA ATAAAGGCAG TGATGATAAC AAAGAAAATA CTGATGCTCC AGTGGCTAAT
GCAGAGCCAG TAGAATTAAA GATTCACTTT CACTCCAATA ATAAATATAC CTTATTAGAT
GAGAACGGAA AGTTATTACC AGTATTTGCT TTGGCTGGGG AGAAGACTAA TTCTATCGTG
GAAAGTACTG CAAATCCTGT AGCACAGAAT TCAATCGAAG AATTTCAGTT ACAGGCAGCT
GAGAAATTTC CAGCAGATAT CTACGGTGGA ACCTCCTTAC GATCTTCTAT TACTTCTTAT
GCATACCAGG GAGCTTTTCT ACCTTTAAAT GACTTAATTG ATCAGTATGC ACCAAACCTC
AAGAAGTTCC TTGATGAGAA TCCAGATGTA AAAGCTGCTC ATACTGCTGC AGACGGCAAT
ATCTATATGT TAAATTATTG TCCGGATGGT GATGTTGCAA GAGTTTATTT TATTCGTACT
GATTGGTTAG ACAAATTAGG CCTTAGTATG CCAACAACAT TCGAAGAGTT GGAAAATGTA
TTATATGCAT TTAAAAACAA CGATCCTAAC GGAAATGGTT TGAAAGATGA AGTTCCGGTA
TTTAATGATA AATGGGAAGA ATTAATACGA TTAGCTAATC TTTGGGGTGC AAGAGTATAT
GGCTTTGATA CTTTTACAGA GCGTGTTGTA TTAGATTCTA ATGATAAATT CTATCAAGCA
TGGACAGCAC CTGAGTTTAA AGAGGCATTA ATTGGATTAA ATAAGTGGTA CAAAGATGGA
ATTATTGATC AGGAAGCATT TACAAGAAAA CAGAATACCG CACGTCAGAC ACTTTGGGCA
AAAGAAAATA CTGGCGGTAT GACACATGAA TTCTTTGCAT CAACATCAGC ATTTAACTAT
AATACAGAAG TATTGGCAGC AGTACCTGAT TTTAAACTGG AAGCTTTCCT TCCTGTCAAT
AAGAATGGTG CAGGATTTGA AGAGCATCAA AGAGCGATAG CAAAACAAGA TGGATGGGCA
ATTTCTGCAA GTACAAAGAA TGCAGAAGCA GCAATTCGTT ATATGGATTG GTTCTACTCA
GAAGAAGGTA GAAGAGCGAT TAACTTCGGT ATTGAAGGTG AATCTTATAC GATGGTAAAC
GGTGTTCCTA CATTTACAGA AGATGTTTTA AAGCAAGGTA ACGTAAATAC TTACTTACAA
AGTGCTTATG GTGCTCAGCT TCCAATTGGT TATAAGCAAA ATTATGATTA TGAAGATCAA
TGGGTAACCA AAGAGGGTAG AGATGCTAAT GAGCTTTATT CTGCCAATAA AGCAAGTGTT
TATACTGTAC CAACTACTCC AGTTCTAAGC TTTACAGAAG AAGAAACAGC AATGTATGAT
CAATGTCTTA CTGCTATAAA TGAATACCAA AATGAGATGG TAACTGCATT TATTACTGGA
AAGACTGATA TCGAATCTAA CTGGGATGCA TATATAGCAA AATGTAAGGA ATTAGGAGTA
GATACTTTAG TAGAACTATA TGAAACAGCT TATGCAAGAT ATAAAGCAAT CAAATAG
 
Protein sequence
MRLKKVMCVT LAAAMLMGSL AGCAKKDNKV DNPSVTQPAT DENNKGSDDN KENTDAPVAN 
AEPVELKIHF HSNNKYTLLD ENGKLLPVFA LAGEKTNSIV ESTANPVAQN SIEEFQLQAA
EKFPADIYGG TSLRSSITSY AYQGAFLPLN DLIDQYAPNL KKFLDENPDV KAAHTAADGN
IYMLNYCPDG DVARVYFIRT DWLDKLGLSM PTTFEELENV LYAFKNNDPN GNGLKDEVPV
FNDKWEELIR LANLWGARVY GFDTFTERVV LDSNDKFYQA WTAPEFKEAL IGLNKWYKDG
IIDQEAFTRK QNTARQTLWA KENTGGMTHE FFASTSAFNY NTEVLAAVPD FKLEAFLPVN
KNGAGFEEHQ RAIAKQDGWA ISASTKNAEA AIRYMDWFYS EEGRRAINFG IEGESYTMVN
GVPTFTEDVL KQGNVNTYLQ SAYGAQLPIG YKQNYDYEDQ WVTKEGRDAN ELYSANKASV
YTVPTTPVLS FTEEETAMYD QCLTAINEYQ NEMVTAFITG KTDIESNWDA YIAKCKELGV
DTLVELYETA YARYKAIK