Gene Cphy_0291 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_0291 
Symbol 
ID5744214 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp361234 
End bp362583 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content38% 
IMG OID641291381 
Productextracellular solute-binding protein 
Protein accessionYP_001557417 
Protein GI160878449 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAATA AAAAATTAGT AAAAGTACTT AGTTCTCTTC TCGCAGCTTC AGTTATTTTC 
ACAGGCTGCA GTAGCACAAA AAATAATCAG CCTTCTTCTG CTGATCCAAC CCCAACAAAC
AATTCTAGCG AAAGCAAACC TACCACAGAG CCAGAAAAGA ACGAGCTTAG CGGCAGTATT
ACATTTTCCG CTTGGGACGT GGATACCCAA ATGCCTTATA TCAAGGATAT GTTAGCTGAA
TTTATGAGTC AACATCCTGG AGTTAATGTG GAAATTGTTG ATATTCCTTC CGCTGACTAT
GATACAAAAT TAAACATCGA TTTAAACGGT GGAGCTGCTG CTGACGTAAT TTTAGTAAAA
AATGCGTCTA CAATGCCATC TATGAATCAA AAGGGTCAAC TTGTCGATTT AAATGAATAC
ATAAAACGCG ACAATGTAGA CTTAAGTTCC TATAACGGAT TAGATAAAAG TATTAGCATT
AACGGTACTC AGCCAGGTCT TCCTTTCCGT ACAGACTATT ATGTATTGTA CTATAACAAA
GACATCTTTG ATACAGCCGG AGTTGCTTAC CCATCAAACG ATATGACTTG GGCAGATTTC
GAAGAACTTG CTAAAAAAGT AACTTTCGGT GAAGGTGCAA ACAAGGTTTA TGGTGCACAC
CTTCATACAT GGCAAGCATT AGTAGAAAAC TGGGCTATAC AGGATGGAAA GAACACAACT
ATGGGTCCTG ATTACGAATT TATGAAACCT TATTATGAGA TGGCTTTAAG AATGCAAAAC
GATGACAAGA CCATTATGGA TTATGCAACA CTTAAAACAG CAAACATCCA TTATTCAGGT
GTATTCCAAA ATGGTTCTGT TGCTATGCTT CCTATGGGCA CATGGTTCAT GGCTACAATG
AGAGATGTTG TAAGCAAAGG TGAATGCAGT GTGAATTGGG GAGTTGCTAC AATACCTCAT
CCAGAGGGCT TAGAGGCAGG AAACACTGTA GGTTCCGCAA CTCCAATTTC AATTAACGCA
GCTTCTAAAA ATAAAGAATT AGCTTGGGAG CTTATCAAAT TTATGACAGG TGATAGCGGT
GCTTCCTATC TAGCTTCCGT TGGTCAATTA CCAGCTCGTA TTAATCCAGA ACTTCTTGAT
ACTGTTACAT CTTTAGAAGG TATGCCTGAA GGTGCAAAAG AAGCTTTACA AGTTAAGAAT
ATCGTATTAG ATCGTCCAAT CGTAGATAAT GTAAATGAAG TTGATAAGAT GCTTGGTGAG
GAACACAGCC TTATCATGCT CGGTGAAGTT ACTATAGACG AAGGAATTAA AGCTTTCACT
GAAAACTCTA AGACAATACA GGAACAATAA
 
Protein sequence
MKNKKLVKVL SSLLAASVIF TGCSSTKNNQ PSSADPTPTN NSSESKPTTE PEKNELSGSI 
TFSAWDVDTQ MPYIKDMLAE FMSQHPGVNV EIVDIPSADY DTKLNIDLNG GAAADVILVK
NASTMPSMNQ KGQLVDLNEY IKRDNVDLSS YNGLDKSISI NGTQPGLPFR TDYYVLYYNK
DIFDTAGVAY PSNDMTWADF EELAKKVTFG EGANKVYGAH LHTWQALVEN WAIQDGKNTT
MGPDYEFMKP YYEMALRMQN DDKTIMDYAT LKTANIHYSG VFQNGSVAML PMGTWFMATM
RDVVSKGECS VNWGVATIPH PEGLEAGNTV GSATPISINA ASKNKELAWE LIKFMTGDSG
ASYLASVGQL PARINPELLD TVTSLEGMPE GAKEALQVKN IVLDRPIVDN VNEVDKMLGE
EHSLIMLGEV TIDEGIKAFT ENSKTIQEQ