Gene Cphy_0529 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_0529 
Symbol 
ID5743443 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp671066 
End bp672460 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content35% 
IMG OID641291641 
Productextracellular solute-binding protein 
Protein accessionYP_001557655 
Protein GI160878687 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.361956 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGACGAT TTTCAAAGGC ATTAGCACTG ATGCTAGGTA TCACATTAGT TTTCACTGGC 
TGTGGTGCTA AAAACAGTAA CAGTGGTGAT GGCAGTAAAG GCAACGAAGG AAATACCAAT
AACACTTCAA CAGTAAAACC AACAGAACCT GCAAAGACAG ATTCAGGAAA GCAAAACACC
ATTGTAGTAT ACTCATGGGA AGCATCTTTA AAAGAGCAAA ATGATAAGGT AATTGCAGCA
TTCGAAAGTA AATATCCTAA CATTAAGGTT GATATGCAAT ATCCTGTTGA GAATGATAAT
GTGAAGTATA CGGAGAAAGT GGATTTGCTT CTTCTTTCTG GTGAGAAAGT GGATGCAGTA
TTAGAGTCTT CTGTAGCAAA ATGTGTAAGT AAAGTACAAA GAAATTTGTA TCAACCATTG
GATCAGTTTA TACAAGCAGA GGGAATTAAC TATGACGACG TTTATAGCGT AAACTCTCAG
GTAGATGGAA GCTATTATGC TTGTCCAATT GACGTCACAC CTTGGTTTAT CATGATGAAC
AAAGATATGT TAGATAAAGC TGGACTTCCT GTACCAACAA GTTGGACTTG GGATGATTAC
AGAGAGTATG CGAAAAAGCT TACAAGTGGT AGTGGTCTTG ATAAGATTTA TGGATCCTAT
TTCCACACAT GGCAGAACTA TGGCTTAATG GGTGTTTATT CAACAAAGAT GGATAATGCT
TATTATAAAG CTGATGGATC TTTAAATTTT GATGATCCTA ACTTAAGAGA TTGGCTAGAG
TTTAGATATG AAATGGAAAA TGTAGATGAA ACTTCTGTGC CACTTATTGA TATCAAAACA
TCAAATTTAG CTTATAGAAA TGAATTTTTT GGTGAAAAAG TTGCTATGCT TCCAACTGGA
ACATGGATGT TAGCAGAAAT TAAAGATGCT GAAAAATGGC CACATAATTT TAAAACTTGC
TTTGCTCCAC TTCCAACCTG GGATAATGGT GCAGAAGGCC GTACTTTCTC CGATACAAAA
ATGTTCAGTA TTCCAAAATC ATCAAAATAT CCAGAAGATG CATATAAATT CATCAGATTC
TATACTACAG AGGGTGCTTA TATTCGTGCA GGTGGTTTAA CAGCAGAAAA GAACATGAAT
CTTGATACAA TTTTACCATT CATAGTTGGT GAGAATCCAG ATGCATTATA TGATATGGAT
TCCGTGAAAA ATGTTTTAAA TAATCCTAAA CTTGAGATGA ATGCTCCAAT GACAGCTCCT
GGCTACAATG CTGAGATCGA TTCCCTATTT GTAGAAGAAA TCGAAAAGTA TTTAGTAGGT
GGAGAAACAT TAGATGATTG TATCAGCAAT CTTAACGAAA GAGGAAAATA TATAGTGGAG
AGCTTTGTTG AATAA
 
Protein sequence
MRRFSKALAL MLGITLVFTG CGAKNSNSGD GSKGNEGNTN NTSTVKPTEP AKTDSGKQNT 
IVVYSWEASL KEQNDKVIAA FESKYPNIKV DMQYPVENDN VKYTEKVDLL LLSGEKVDAV
LESSVAKCVS KVQRNLYQPL DQFIQAEGIN YDDVYSVNSQ VDGSYYACPI DVTPWFIMMN
KDMLDKAGLP VPTSWTWDDY REYAKKLTSG SGLDKIYGSY FHTWQNYGLM GVYSTKMDNA
YYKADGSLNF DDPNLRDWLE FRYEMENVDE TSVPLIDIKT SNLAYRNEFF GEKVAMLPTG
TWMLAEIKDA EKWPHNFKTC FAPLPTWDNG AEGRTFSDTK MFSIPKSSKY PEDAYKFIRF
YTTEGAYIRA GGLTAEKNMN LDTILPFIVG ENPDALYDMD SVKNVLNNPK LEMNAPMTAP
GYNAEIDSLF VEEIEKYLVG GETLDDCISN LNERGKYIVE SFVE