Gene Cphy_1919 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_1919 
Symbol 
ID5744598 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp2369079 
End bp2370116 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content36% 
IMG OID641293016 
Productglycosy hydrolase family protein 
Protein accessionYP_001559027 
Protein GI160880059 
COG category[R] General function prediction only 
COG ID[COG4225] Predicted unsaturated glucuronyl hydrolase involved in regulation of bacterial surface properties, and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTAAAAA TTGAGTATGA CAGAGAAAAT ATATTAGAAG TAATTGACCG TGTTGTGAAA 
AAAACGATGA CAATGGATAT GACATGGGAT TGGCCTTGTG GAGTTGCTTA TTATGGAATA
AGTGAGGCTT ATCGCGTTAC AAAAAATGAA GAATATATCA ATCTATTAAA ACAATGGACA
GATGAATACA TTGCACTTGG TTTACCTAAT TGGACAGTAA ATACCTGTGC AATGGGACAC
TGTATGATTA CATTATATGA AGCAACTAAT GATGAAAAAT ATTGGGATAT TGTCATGAGC
AAAATTGATT ATATTCGTAA TAAAGCGTTA CGTTTTGGCG ATAATGTACT TCAACATACC
GTATCCGTTA ATAATGATTT TCCAGAACAA GCATGGGCAG ATACTTTATT TATGGCAGCA
TTTTTCTTGT TGCGTGTAGG CGTGAAATTA AAGGATCCTG ATATGATCGA TGATGCACTT
AATCAGTATT ATTGGCATAT TCAATATTTA CAAGAGCCAA GAACAAGTCT TTGGTATCAT
GGTTATAACA ATATCAATAA AGATCATATG TCAGGATTTT ATTGGGGACG TGCCAATGCT
TGGGCAGCTT ATACAATGTC ACAGGTTAGT AAAAGACTTC CTGAACCATA TTTATATCCA
AACTATATGG AAATTGACTG CGCACTACGT GACCAACTAG CAGCTCTTAA ACTCTTACAG
ACAAAAGATG GCTTATGGAG AACAATTCTT GATGATGAAG AGTCCTACGA AGAAGTATCT
GCAAGCTGTG GAATCGCAGC AGCAATGGTA ACAAACCAAA ATCCACTTCA CACCAAATAT
ATTCAAAAAG CACTTGATGG TATTCTTAAA AACATATCAG AAGATGGTAG AGTACTTAAT
GTTTCTGGCG GAACAGCAGT AATGAAAGAC AGAGAAGGTT ACCGTAATGT ACCTAAGACC
TGGATGCAGG GATGGGGTCA AGGTTTAGCA CTTTCGTTCT TATCCGCACT TGTGGACGAT
AGCCAGAAAT TATTTTAG
 
Protein sequence
MLKIEYDREN ILEVIDRVVK KTMTMDMTWD WPCGVAYYGI SEAYRVTKNE EYINLLKQWT 
DEYIALGLPN WTVNTCAMGH CMITLYEATN DEKYWDIVMS KIDYIRNKAL RFGDNVLQHT
VSVNNDFPEQ AWADTLFMAA FFLLRVGVKL KDPDMIDDAL NQYYWHIQYL QEPRTSLWYH
GYNNINKDHM SGFYWGRANA WAAYTMSQVS KRLPEPYLYP NYMEIDCALR DQLAALKLLQ
TKDGLWRTIL DDEESYEEVS ASCGIAAAMV TNQNPLHTKY IQKALDGILK NISEDGRVLN
VSGGTAVMKD REGYRNVPKT WMQGWGQGLA LSFLSALVDD SQKLF