Gene CPS_4039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPS_4039 
Symbol 
ID3520447 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameColwellia psychrerythraea 34H 
KingdomBacteria 
Replicon accessionNC_003910 
Strand
Start bp4246460 
End bp4247656 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content43% 
IMG OID637286485 
Productputative glutathione-independent formaldehyde dehydrogenase 
Protein accessionYP_270697 
Protein GI71279984 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID[TIGR02819] formaldehyde dehydrogenase, glutathione-independent 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGCTCAA ATCATGGAAA CCGCGGCGTT GTTTATACTG GCCCTGGAAG TGTTGAAATT 
CAAGATATTG CCTACCCTAA GCTTGCCATT GGCAATCGAA AATGTGAGCA CGGTGTTATT
TTAAAAGTAG TAACTACCAA TATTTGTGGC AGTGATCAAC ATATGGTACG TGGCCGTACT
ACCGCTGAAC CTGGGCTTGT TTTAGGTCAT GAAATTACCG GTATGATCAT TGAAAAAGGC
AGCGATGTAG AGTTTTTAGA CATTGGTGAT ATTGTCTCTG TACCTTTTAA TATTGCTTGT
GGTCGCTGTA GAAACTGTCG CGAAGGAAAC ACAGGTATTT GCTTAAACGT TAATCCTGGT
CGCGCTGGTG CCGCCTTTGG TTACGTTGAT ATGGGCGGCT GGGTTGGTGG TCAATCTGAA
TATGTGATGG TGCCTTATGC TGACTTCAAC CTGCTAAAAT TTCCTGATAA AGATCAGGCA
TTAGAAAAAA TTCGTGACTT GACCATGCTC TCTGATATTT TCCCAACGGG ATATCATGGT
GCTGTAACTG CTGGTGTTGT TCCTGGTGCC ACAGTTTATA TTGCTGGTGC TGGACCTGTA
GGTCTTGCTG CTGCAGCTTC ATCACAATTA CTTGGCGCGG CTTGTGTCAT TGTAGGTGAT
ATGAATCCAG AGCGTCTAGC TCAAGCTCGT AGCTTTGGCT GTGAAACTAT CGATTTACGT
CAAGATGCCA CAGTGCCAGA TATGATAGAA CAAATATTAG GTGTTCCAGA AGTAGATGCT
GCTGTGGATT GTGTTGGTTT TGAAGCCCAC AGCCACGGTT GTAGTCATCA TAAAGAACAG
CCTGCAATTG TACTTAATAC TATGATGGAA GTAACACGTG CTGGGGGTGG TATCGGTATT
CCAGGGCTTT ATGTAACTGG CGATCCAGGT GCATCTACTG AAGCGGCTAA AACTGGCCAA
CTTAGTATGA ATTTTGGTCT TGGTTGGGCG AAATCACATT ATTTTGTTAC CGGTCAATGT
CCAGTAATGA AATATCATCG CAACCTAATG CAGGCTATTT TGTGGGATAA AGTTCAAATT
GCTAAAGCGG TTAACGTGAA AGTCATTTCA CTTGATGGTG CACCTGAAGG TTATAATGCC
TTTGATAAAG GCGCCGCACA AAAGTTTGTC ATTGACCCTC ATTCAATGGT GGTTTAA
 
Protein sequence
MCSNHGNRGV VYTGPGSVEI QDIAYPKLAI GNRKCEHGVI LKVVTTNICG SDQHMVRGRT 
TAEPGLVLGH EITGMIIEKG SDVEFLDIGD IVSVPFNIAC GRCRNCREGN TGICLNVNPG
RAGAAFGYVD MGGWVGGQSE YVMVPYADFN LLKFPDKDQA LEKIRDLTML SDIFPTGYHG
AVTAGVVPGA TVYIAGAGPV GLAAAASSQL LGAACVIVGD MNPERLAQAR SFGCETIDLR
QDATVPDMIE QILGVPEVDA AVDCVGFEAH SHGCSHHKEQ PAIVLNTMME VTRAGGGIGI
PGLYVTGDPG ASTEAAKTGQ LSMNFGLGWA KSHYFVTGQC PVMKYHRNLM QAILWDKVQI
AKAVNVKVIS LDGAPEGYNA FDKGAAQKFV IDPHSMVV