Gene Cphy_1529 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_1529 
Symbol 
ID5744351 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp1878477 
End bp1879796 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content38% 
IMG OID641292631 
Productextracellular solute-binding protein 
Protein accessionYP_001558642 
Protein GI160879674 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTAAAGA GGAAAATATC AGCACTAGTT CTTGTCGCAG CAATGACTTT TAGTTTACTA 
GCAGGATGTG GGAATAAATC TTCGGATACT TCCACAAACA AAACGGATGG GGTAGTTGAA
ATTAAGTGGA TGTTTTGGGA TGACTTATCG GCCACTCAGG ACTTAATTAC AAAAGGATAC
GCACAAGTAA TTGAACGTTT TAATGAACAA TATGCTGGAA GATATCACTG TACACCGATT
ACTACAAATC TAGAGGAATA TGATACGAAA TTAAATGCAT TAGTTGCTGC CAATAATTGT
CCGGATGTAT TTATCTGTAA TCCAGGACCT AATTTAACGC AGTATGTGGA AAGTGGGACA
GCAGCGGATC TAACGGATAT TTTAAAGAAA AACGAATCAA AGTGGTATGA ATCCTTTACA
GAAGGTATTT TCGAGAGAAT GACATACGAT GGAAAAATTT ATGCAGTTCC AACAAACTTT
GCAGCAGCTT TAGTTTTTTA TAATACAGAG ATATTTGGAA AAACAGGGGC TTCTGTTCCA
ACAACATTTA CGGAGTGGAT TGCTACCTGC AAAAAGATTC AAGATGCTGG ATACACACCA
ATTTCTTGCT CAGCTGGTAC CGCTTGGTGC TTATCGATGA TTGCAGGCTA TCTTTGTGAT
AGAGCTGGTG GACCAGATAA TTTAATTGGA GTAAATACGG GCACTTTAGA TTGGACGAGT
GAATCTTTTC TAAATGCTGG TGAAAAGTTA GTTGAATTAT CAAAGTATTT TCAAAAAACA
GCAGCAGGAG ATTCGAATGA TCAAGCAACT GCAGGTTTTT ATAACGGCGA AGCTGCCATG
TTAGTACAGG GATCTTGGGC AATTGGCCAG ATTAATGGTA ACAATCCTGA GATGGAGCCA
AAGTGTGGAG TATTTTCTTT CCCAGCAATC GAAGGTGGGG CTGATCCTAA TCGTATGATT
GTTAAAACGG ATAATCTGGT TATGAGTTCA AAAACGAAAA ATCAGGACGC TTGTATAGCT
CTTATGAAAT GTTTTACAGA CGAAACAGCA CAGAAGTATA CTGCCGAAGT TGGCGGAAAG
ATTCCAATTA TTAAGGTTGA TTTCGATAAA GAAAAAGCAC CAGCACAGCT TAGTTATGTT
ATGGATATCT TAACAAAATC GACTGGTACC CTTGGTTTTT ATAATGAATC TTTAGCATCT
GTAGAGGCAG GGGATACATT CGATAATTCT ATGGTTGACC TATTCCTCGG AAGTATTTCG
GTAAATGAAG CATTCCAAAA TGTTCAGGAT TTCTATAAAG AGCATGTATG GAAAAAGTAA
 
Protein sequence
MLKRKISALV LVAAMTFSLL AGCGNKSSDT STNKTDGVVE IKWMFWDDLS ATQDLITKGY 
AQVIERFNEQ YAGRYHCTPI TTNLEEYDTK LNALVAANNC PDVFICNPGP NLTQYVESGT
AADLTDILKK NESKWYESFT EGIFERMTYD GKIYAVPTNF AAALVFYNTE IFGKTGASVP
TTFTEWIATC KKIQDAGYTP ISCSAGTAWC LSMIAGYLCD RAGGPDNLIG VNTGTLDWTS
ESFLNAGEKL VELSKYFQKT AAGDSNDQAT AGFYNGEAAM LVQGSWAIGQ INGNNPEMEP
KCGVFSFPAI EGGADPNRMI VKTDNLVMSS KTKNQDACIA LMKCFTDETA QKYTAEVGGK
IPIIKVDFDK EKAPAQLSYV MDILTKSTGT LGFYNESLAS VEAGDTFDNS MVDLFLGSIS
VNEAFQNVQD FYKEHVWKK