Gene Cphy_2654 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_2654 
Symbol 
ID5742821 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp3239628 
End bp3241388 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content37% 
IMG OID641293746 
Productsugar ABC transporter substrate-binding protein 
Protein accessionYP_001559754 
Protein GI160880786 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAAA AGGTTCTTAG TTTAGTTTTA TGTGTAGCCA TGGTAGCATC TTTGGCTGCA 
TGCGGTAACA AGGGTAAAGA CGCTTCAAAC AACAATAATC CAACTGCACC GGCTGCAACA
AAAGCACCAG CTGCAACACA AGTACCGGAA GTAAAATTAG GGTATGAATT TGGTTTAGAA
ACAACATTTC ATTCAGATGA ACCAGTTACA TATTCCATGT TATTTAGTGA TGCCAGTTGG
TATCCAATGG TAGACAGATG GGAAACAGAG GGAGTATTTG CTAAGATTAA AGAATTAACT
AATGTTACTT TAGACGTTAC AAAAATTGAT AGCGGTGATT ATGATCAAAA GAAATCTCTT
TTGATTAATG CAGGTCAGTC TGCATATATC ATTCCTAAGA CTTATGATGA ATCCGCATTT
GTTGACGGTG GCGCAGTAGT TGCAGTATCC GATTGGGTAC AGTATATGCC TAACTATACG
GAGTTCTATG ATAAGTATAA TATGGAAGCA GACGTTAAGA CAATCGTTCG TGCAGATAAT
AAGTATTATC GTTTACCTGG TATGAAAGAA AGTTCTTTAC AGGATTATAC TCTCTTAATT
CGTAACGATA TCTTTAAGGC AGCTGGTTAT GATGTTGCTG CTCTTGAAAA AGATTGGACA
TGGGATGATT TATATGATGT ATTAGTTGGT GTTAAGGCTT ATATGGTAAG CAAGGGTATG
TGCAAAGAAA GTGATTATAT CTGGTCAGAC CTTTGGTGTG GAAATGAGTC AGGTCAAGGA
AATGGTGGTA ACCTTCTTAA ATTAATGGGT GCTTCCTATG ATGTTCCTTC TGGCTGGGCA
GTAGCAGATG GTATGCAATA TGATGCTGCA ACAGATAAGT GGTATTTCGC ATCAACATCA
GATAACTACA AGCAATTTGT AACTGTTGCC AATAAATATG TTGCTGGTGG TATATTAGAT
CCTGAAACAT TTACACAGGA TGATGTTACT GCAAATAATA AATTCTATCG TGGTGAGACC
GCTATTATCA GTGTTAACCG TTCTCAGTAC ACAGCATGGC ATCAAGGTTT GGATGAAGGT
ATTGGCAAAG GAAATTACGA GACATATTTA ACTGTATATC CTAGAGGAAA CAATAAATAT
ACTTCTGAAA ACACTCGTCT TGAGAATGGT GTTATGATCG CAAGCAAGGC TTTGAAAGAA
CTTGGTGAAG ATGATTTTAT TAAGATGCTT CGTTTTGTTG ACTGGTTATG GTACTCTGAT
GCTGGTAAAA CACTTGCAAA ATGGGGTGTT GAGGGTGAAC ATTGGAACTA TGTAACAGAT
GCAACAACTG GACTTAAAGT AAAAGCTTTG ACTGAGAATT GGTTCTGTGG AGGACTTGGT
ATACCTGCAA CAGACGATAC AAAGCAGAAG GATTTAAGAC TTGAGCTTGG ATATGCAGGT
GGTAACTTCT GGTATGGTGG TAGCAATGCA CAGTTAACTG ATAACTTTAC ACCTGTTATG
CAGGATTATT ATGCAAGAGT TGCTGCAGAT AGAGCAATTA AACCATTAAA TCCTTCCTTC
GCTAGAACGG AAGATGAAAA TGATCAGATT AATCTTTGGA AGACTCCATT GATTGATAAT
GTTAACTCAT GTACACTTAA GTTCGTGACT GGCCAAATGA ACATCACAAA TGATTGGGAT
GCTTATGTAG CAAGTTGTAA GAATTTAAAT AGTGAAAAAT TAGTTGACCT ATACAATGAT
ATGTATAACC GCTCAAAATA A
 
Protein sequence
MKKKVLSLVL CVAMVASLAA CGNKGKDASN NNNPTAPAAT KAPAATQVPE VKLGYEFGLE 
TTFHSDEPVT YSMLFSDASW YPMVDRWETE GVFAKIKELT NVTLDVTKID SGDYDQKKSL
LINAGQSAYI IPKTYDESAF VDGGAVVAVS DWVQYMPNYT EFYDKYNMEA DVKTIVRADN
KYYRLPGMKE SSLQDYTLLI RNDIFKAAGY DVAALEKDWT WDDLYDVLVG VKAYMVSKGM
CKESDYIWSD LWCGNESGQG NGGNLLKLMG ASYDVPSGWA VADGMQYDAA TDKWYFASTS
DNYKQFVTVA NKYVAGGILD PETFTQDDVT ANNKFYRGET AIISVNRSQY TAWHQGLDEG
IGKGNYETYL TVYPRGNNKY TSENTRLENG VMIASKALKE LGEDDFIKML RFVDWLWYSD
AGKTLAKWGV EGEHWNYVTD ATTGLKVKAL TENWFCGGLG IPATDDTKQK DLRLELGYAG
GNFWYGGSNA QLTDNFTPVM QDYYARVAAD RAIKPLNPSF ARTEDENDQI NLWKTPLIDN
VNSCTLKFVT GQMNITNDWD AYVASCKNLN SEKLVDLYND MYNRSK