Gene Cphy_0822 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_0822 
Symbol 
ID5745302 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp1050966 
End bp1052156 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content33% 
IMG OID641291936 
ProductHAD family hydrolase 
Protein accessionYP_001557948 
Protein GI160878980 
COG category[J] Translation, ribosomal structure and biogenesis
[R] General function prediction only 
COG ID[COG0637] Predicted phosphatase/phosphohexomutase
[COG1670] Acetyltransferases, including N-acetylases of ribosomal proteins 
TIGRFAM ID[TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED
[TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTAAAAG CGATATTGTT TGACATGGAT GGAGTTATTA TAGATAGTGA ACCTTTACAT 
TGTAAGGCGT TCCAGAAGGC TATGAAGCTG TTTGGTCTGG ATTTATCAAA AGAATATTGT
TACCAATTTA TCGGAAATAC AGATCGCTAT ATGGTAGATG TACTAGTTAA AGATTTTAAC
CTACCGAATA CTTCAGAAGA AGTAATTCGA ACAAAACAAG AAGTTCTAAA TCAGCTTGAG
TTAGAAGAAA GTTATCCTGC AGTTCCTTAT GTAGTTGATT TAATTAAAAA TCTTTCAAAA
CATCCAATTA AATTAGCTAT AGCAAGTTCC TCACCAATGG AACAGATTGA ACGAACTGCA
ATAGATTTAA ACCTTACCTC CTATTTTCAT GATTATGTTT CCGGAATGGA CTTAAAACAC
TCGAAACCAG CTCCGGATAT CTTTTTAAAA GCTGCTAGTT TGCTTGGTGT TTCACCAGAT
GAATGTTTGG TAATAGAGGA TTCGTATAAT GGTGTTACTG CGGCCAAAGC AGCAGGAATG
ACTTGTGTAG GTTATTATAA TGAGAATTCC GGGAATCAAG ATCTAAGTGG TGCAGATATA
ATTGTAGAGG GTTTTGAAGA GATTACATTT TCATTTTTAA ATAATGTATA TCTACGTTCT
CATGGAGAAC CTGTTACAAT TGCAACAACA GAACGTCTTA TCATTCGTGA ATTAAGTGTT
GATGATATCG TCTCTATGTA TCATATCTAT CAGCAACCGG AAGTCCGAGA ATTTGTTGAT
GATATCGATG ATTATCTACA AGAAGAAATC GAAAAACATA AAGCATATAT CAAAAATGTT
TATAATTTTT ATGGTTATGG ATTCTGGGGT ATCTTTAGTA GAGAAACAAG TGAATTAATC
GGCCGTTGTG GAATTCAGAA TTCTGAAATT AATGGCCGCT TTGAAATTGA ACTTGGATAT
TTACTTAATA TAGACCATTG GGGTTACGGA TACGCATTGG AATGTACAAA AAGTGTCTTA
GAATATGCTT TCTATGAACT TCATATTCCT CGCATTGTTG CTGTCATTGA CAAAAAGAAT
TCTCGTTCTA CAAAGGTTGC AATGCATGTT GGTATGAACT TAGAAGCAGA AATTTATCAT
AAAGGTAGAA ATTGTGATTT ATATGTCATT GAGAATCCAA ATATAGAATA A
 
Protein sequence
MLKAILFDMD GVIIDSEPLH CKAFQKAMKL FGLDLSKEYC YQFIGNTDRY MVDVLVKDFN 
LPNTSEEVIR TKQEVLNQLE LEESYPAVPY VVDLIKNLSK HPIKLAIASS SPMEQIERTA
IDLNLTSYFH DYVSGMDLKH SKPAPDIFLK AASLLGVSPD ECLVIEDSYN GVTAAKAAGM
TCVGYYNENS GNQDLSGADI IVEGFEEITF SFLNNVYLRS HGEPVTIATT ERLIIRELSV
DDIVSMYHIY QQPEVREFVD DIDDYLQEEI EKHKAYIKNV YNFYGYGFWG IFSRETSELI
GRCGIQNSEI NGRFEIELGY LLNIDHWGYG YALECTKSVL EYAFYELHIP RIVAVIDKKN
SRSTKVAMHV GMNLEAEIYH KGRNCDLYVI ENPNIE