Gene Cphy_2274 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_2274 
Symbol 
ID5745333 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp2803758 
End bp2806796 
Gene Length3039 bp 
Protein Length1012 aa 
Translation table11 
GC content37% 
IMG OID641293364 
Productextracellular solute-binding protein 
Protein accessionYP_001559374 
Protein GI160880406 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAACA AAAAGGAGAA GAAGAAAATG GGAAGAAGAA TGAGGTTTAG AAGAACTCTA 
TCATTCCTTC TGGCTGCGAC CATGCTATTT ACACCAATTC AAAGTGGAAA GACTATAATT
GCTGCCGAAA ATAAAACATT GGCTGATTAC GAAGAAATTG TTGGTTCCTA TTCCGTTGAC
AAATCCATTC CGCGATATAC GATTTATAAG GAAGGCTTTA ACAATGTGAA ACCTCCGGAT
GAGTACACGA TTGAAGCAAA TGATTATGTT TATTATACAG AGGGCGAACT TGATTCAAAC
AAGAAAGAAA TAGCCGCAAT ACCAGAAGAA TTGAACGATT ATGAAGGCAT GGATGGAACC
TCGGTTATTA TTTCAGAAAC GGGATTTATT GAATATGAAG TAGATATAAA AAATGCAGGT
TTTTATGATA TTGCCTTAAC TTATTATCCA ATTGAGGGAA AGAATTCTGA AATTCAAAGG
AGTTTCTTTA TTGATGGAGA ACTTCCCTAC GATGAACTGG CACTTATTGA ATTCTCCAGA
GTATGGTCAA ACAACATTAC AGAAACCTAC CTCAATGAGG ATGGAATACC GACAAAGAAA
TGGGAAAAAG ATAATCAAGG AAATGACTTA AAGCCGACGC TTTATGAAAC ACCAGAATGG
ATTACGAGTT ATTTATATGA CAGTAATGGC TTTGTTATAA ATCAGCTTTC GGTGTATTTT
ACTGAAGGAG TACATAAAAT TTCTATGCTT TCTTTAAGAG AACCGATGTT ACTTCGTAGG
ATTACCTTAA ATAATACAAC GGAAGTAGTA GACTATAAAA CTAAAAAAGC AGAATGGGAT
GGGATGAATG CAAAGGATAC GTCAGGCATT CTTGTAAGGG TCGAAGCAGA ATCTGTTACG
AAAACGTCTT CGCAAATGTT GTATCCAAAA CAAGACCAAT CCTCCCCTGC GGTTTATCCT
TCAAGTACCA AAGAACTGCT AAACAACACG ATTGGAGGCA ATTCCTGGCG TTTGGTAGGC
CAGTGGATGG AATGGGATTT TGAGGTACCA GAGAGCGGTT ATTACAAAAT TGGGTTCTAT
GCGAAACAAA ATTTCGTAAG AGGTATCTAT GTTTCGCGAA AAATTACAAT TGACGGTAAT
GTTTTGTTCA ATGAACTAAG CGATTATGGC TTCCAATATC AATCCAATTG GAGATTTGAT
ACCTTAATGG ATGAAAATAA TGATGCCTAT AAGATTTATT TAGAAGCAGG AAATCATACC
CTTCGTATGC AAAACGTATT AGGCGAATTC TCTAGTATCA TCGGAGAGGT TCAAGAATCC
TTATCAAAAC TAAACTCTAT CTATCGAAAG ATCATCCGTA TTACGGGTGT GAAACCGGAT
ATTTATAGTG ATTATCAGAT TGAAGCAAAA TTACCAGAGC TAGAAGCTGA GCTAATCGAT
GTACATAATC AACTCGATTT CGCAATCAAA CATTTACGTG AGGTAGCAGG AGGAAGTAGT
GATAAGGAAG CTGTTTTAGT AACGATGCGT GATCAGTTAG AGGACTTAAT AAAAGATCAG
GAATATTTTA AAAAGATTGT GACTTCTTAT AAGATTAACG TTCGTGCGGT AGGAACTTGG
TTATCAGGGG CAGTTTCGCA ACCTTTACAG TTAGATGCCA TTTACATCTA TTCGCCAGAT
GTGGATGCGA ATATATCGAG AAGTTCTTTC TGGTCGAAAT TATGGTACGA GATTTGTAGA
CTTTTTTATT CCTTTGTTAT CGATTACAAC CAGATTGGTA ATGTTTCAGA AAGAGGGAAA
GAAAGCCATA CCATTACGTT ATGGGTTGGT ACTGGACGTG ACCAGGCAAA TGTAATCAAA
TCCTTGATTG ATGAGACCTT TACAAAGAGA ACAGGAATTA ATGTAAATGT AATGCTCGTT
GATATGGGAA CTTTACTACA AGCAACGCTT GCTGGCCAAG GACCCGATGT CGCGATTCAG
CTTAATATAT CAAATCCAAC TTATAATAGC GCAATCCAAA GCTCGAATGA TATGCCTATG
AATTATGGAC TTCGTAATGC GGTTGCTGAC TTAAGTCAGT TTAGTGACTT AAAGGAAGTT
AGGGAGCGCT TTTTTGATAG CGCACTGGTT CCGTTCACCT ATGACAATCA TACCTTCGCT
CTCCCAGAGA CACAAACATT CCCGATGATG TTTTATCGAA AAGATATCTT AAAAGAGTTG
GGACTTACCT TACCACAAAC TTGGGATGAT GTGAAAGTAA TCATGTCAGT ACTTGCTAAA
AATCAGATGG AATTTGGTAT GTTACCAAAT GAGTTAAATT TCTTAATGCT ACTAAATCAG
TACGGTGGCC AGTATTATAA CGAAGATGCA ACAAAATCTG CACTAGATAG TGATGAAGGA
ATTAACGCAT TTAAGGAGTA TTGTAGTCTC TATACCGAAT ATAAACTCGA TAAAGTTACC
AGTGTTGAGG ACCGATTTCG TACTGGGGAA TGTCCAATCA TTATTGCGGA TTATTCTGTT
TATAATAATT TTCAAGTTTC TGCTCCCGAC ATCAAAGGGC TGTGGGGATT TGCACCGGTC
CCTGGTATGA GGAAAGAAGA CGGGACAATA AATCGAAATG TAGCTAGTGT TGGTTCAGCC
TGCGTTATTA TGGAGTCGTC AAAATATAAA GAAGACTCCT GGGAATTTTT AAAGTGGTGG
ACTAGCGCAG AGATACAAAC ACTTTATGGA AAAGAGATGG AAAGTTTAAT GGGAGCTTCT
GCGCGTGTAG CAACTGCGAA TAAAGAAGCA TTTGAAAGCC TTCCATGGCC ATCTGCAGAT
TACAATGCTT TAAAGAAACA GTTTCAAAGT GTTGTTGGTA CAAGACAGGT ACCAGGTGGA
TACTTTACTT GGAGAAACAT CGACAATGCA TTCTATAAAG TAACCACGAA TACGGATTCT
GCATCTGCAA GAGAGTGCCT CATGGATAAT ATCATCTATA TTAATGATGA AATTAATTAC
AAGAGAAAAG AATTTCATAT GCCATTGTCG AATGACTAA
 
Protein sequence
MNNKKEKKKM GRRMRFRRTL SFLLAATMLF TPIQSGKTII AAENKTLADY EEIVGSYSVD 
KSIPRYTIYK EGFNNVKPPD EYTIEANDYV YYTEGELDSN KKEIAAIPEE LNDYEGMDGT
SVIISETGFI EYEVDIKNAG FYDIALTYYP IEGKNSEIQR SFFIDGELPY DELALIEFSR
VWSNNITETY LNEDGIPTKK WEKDNQGNDL KPTLYETPEW ITSYLYDSNG FVINQLSVYF
TEGVHKISML SLREPMLLRR ITLNNTTEVV DYKTKKAEWD GMNAKDTSGI LVRVEAESVT
KTSSQMLYPK QDQSSPAVYP SSTKELLNNT IGGNSWRLVG QWMEWDFEVP ESGYYKIGFY
AKQNFVRGIY VSRKITIDGN VLFNELSDYG FQYQSNWRFD TLMDENNDAY KIYLEAGNHT
LRMQNVLGEF SSIIGEVQES LSKLNSIYRK IIRITGVKPD IYSDYQIEAK LPELEAELID
VHNQLDFAIK HLREVAGGSS DKEAVLVTMR DQLEDLIKDQ EYFKKIVTSY KINVRAVGTW
LSGAVSQPLQ LDAIYIYSPD VDANISRSSF WSKLWYEICR LFYSFVIDYN QIGNVSERGK
ESHTITLWVG TGRDQANVIK SLIDETFTKR TGINVNVMLV DMGTLLQATL AGQGPDVAIQ
LNISNPTYNS AIQSSNDMPM NYGLRNAVAD LSQFSDLKEV RERFFDSALV PFTYDNHTFA
LPETQTFPMM FYRKDILKEL GLTLPQTWDD VKVIMSVLAK NQMEFGMLPN ELNFLMLLNQ
YGGQYYNEDA TKSALDSDEG INAFKEYCSL YTEYKLDKVT SVEDRFRTGE CPIIIADYSV
YNNFQVSAPD IKGLWGFAPV PGMRKEDGTI NRNVASVGSA CVIMESSKYK EDSWEFLKWW
TSAEIQTLYG KEMESLMGAS ARVATANKEA FESLPWPSAD YNALKKQFQS VVGTRQVPGG
YFTWRNIDNA FYKVTTNTDS ASARECLMDN IIYINDEINY KRKEFHMPLS ND