Gene Cphy_1125 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_1125 
Symbol 
ID5741960 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp1418251 
End bp1419804 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content38% 
IMG OID641292230 
Productglycoside hydrolase family 3 protein 
Protein accessionYP_001558242 
Protein GI160879274 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACTTAT CAATAAGAGG AAAAGTTGGA CAACGAATTG TAGCAGGATT TCCTGGTACT 
ACGATCGATA GTGAATTAGA GGATTTTATC CGTACCTATA AGATTGGGAA TTTCATTTTA
TTTAAAGAAA ATATTGTAGA TGCGAATCAG TTAAGTAATC TATGTGAGGG ATTACAGCAG
CTTACAAAAA AATATACCGG ACATAGAGCA TTTATAACGA TAGATCAAGA GGGTGGAATG
GTAACAAGAC TATCCGAAGA TAGTGTTAAT ATTCCTGGTG CGATGGCGAT TGCTGCAACT
AGGGATGAAA AAAATGCTTA TATGGCAGGA AGGATTACTG GTCAGCAATT GAGAACCTTA
GGGTTTAACT TTGATTTAGC ACCTGTGGCA GATATCAATT CCAATATGGA TAATCCGGTC
ATTGGGGTAA GAAGTTATGG TGATGAACCA GATCAGGTAG CAAAATACTG CGTAGCCATG
ATGAAAGGTC TTACCGATGG AGGGGTGCTG GCTTCAGCGA AGCATTTCCC AGGGCATGGA
GATACCAATG TAGATTCTCA TCTTGGGTTA CCGAAGGTAC ATAAGTCATT AGAGGAGATG
GAGCTGTGCG AATTAGTTTC ATTTAAAGCA TTGATTGAGG CAGGAATACC GGCAATCATG
TCATCCCATA TCATTTTCCC AGCCTTAGAA GAGGAACTTC CAGCAACGAT GTCAAGAAAG
ATTATCACAG GACTCTTAAA GGAGAAGCTT GGATTTAAAG GGTTAGTTAT TAGTGACTGT
ATGGAAATGA GTGCGATTAA GAAATACTAT GGATCAATTG AGGGAATCAA GCATGCGATT
GAGGCAGGTG TTGATTTAAT CTTTGTATCT CATACCATGA GTGTCGCAAG GGAAGCCTCA
GATGTATTGA CAGGCTTGTA TGAAAAAGGC GAATTATCTA TGGATGAGAT GGATGCATCT
ATTGATAAAA TTATGTACTA TAAAGATAAA TGTTTATGCA ATGAGAACGA AAAGCATGAT
ACCAATGAAT TTGATGTGAA GGCTGGAATT GAATTTACAA AAGAGCTTCT TCGAAAGAGT
TTAACCCCAA TTCAAATGCC TAGCGATAAC CTACCAGTTG TTGATCATAA TTCCCTATTT
CTTGGATGTA TGCCATTTAG AGCTACGAAT GTTTTCAATA TAGATGCAGG TGCCTTTCAA
TTTGCAGATT ATATGGCTAA GTATTTTAAT GGAAATGGTA TTTTAACATC ACCACAACCT
ACAGACGAAG AGATGGAAGC GTTAATACAA CCAATGAAAG AAGCAAGTAC TGTAGTCATC
GCAACGTATA ATGCACATCT ATATAAAGAA CAACTAAAAC TCGTTGAACT TGCAGCAAAA
TCCAATACAA ACGTTATTGT TTTTGCTCTT AGAAATCCAT ATGATTTAAA AGACTTACCA
GCAAATGTGT ATGGAATTGC TGTCTATGAG TATACCTTAA AGAGTGTAGA GGCATTGGCA
GAATATATGA AACAGCCATA TGAGTTGAGT GGAAAATTAC CTGTGAAGAT GTAA
 
Protein sequence
MDLSIRGKVG QRIVAGFPGT TIDSELEDFI RTYKIGNFIL FKENIVDANQ LSNLCEGLQQ 
LTKKYTGHRA FITIDQEGGM VTRLSEDSVN IPGAMAIAAT RDEKNAYMAG RITGQQLRTL
GFNFDLAPVA DINSNMDNPV IGVRSYGDEP DQVAKYCVAM MKGLTDGGVL ASAKHFPGHG
DTNVDSHLGL PKVHKSLEEM ELCELVSFKA LIEAGIPAIM SSHIIFPALE EELPATMSRK
IITGLLKEKL GFKGLVISDC MEMSAIKKYY GSIEGIKHAI EAGVDLIFVS HTMSVAREAS
DVLTGLYEKG ELSMDEMDAS IDKIMYYKDK CLCNENEKHD TNEFDVKAGI EFTKELLRKS
LTPIQMPSDN LPVVDHNSLF LGCMPFRATN VFNIDAGAFQ FADYMAKYFN GNGILTSPQP
TDEEMEALIQ PMKEASTVVI ATYNAHLYKE QLKLVELAAK SNTNVIVFAL RNPYDLKDLP
ANVYGIAVYE YTLKSVEALA EYMKQPYELS GKLPVKM