Gene Hoch_6540 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_6540 
Symbol 
ID8548957 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp8974985 
End bp8976364 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content70% 
IMG OID646391203 
ProductL-serine dehydratase 1 
Protein accessionYP_003270902 
Protein GI262199693 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1760] L-serine deaminase 
TIGRFAM ID[TIGR00720] L-serine dehydratase, iron-sulfur-dependent, single chain form 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00743621 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCATCA GCATTTTCGA CATGTTCACC GTCGGCATCG GCCCCTCGAG CTCGCACACG 
GTCGGCCCCA TGCGCGCGGC TCGGCGTTTC GTCGCGCACC TGGAGGCGCG CGGAACCCTG
GCGGCGACGC GCTCGCTGCG GGTCGAGTTA TTCGGCTCGC TGGGTCACAC CGGCAAGGGC
CACGGCACCG ATCGCGCGGT GCTCATGGGC CTGGAGGGCG AGGATCCCGA GACCGTGGAT
CCCGGCACCA TCGGCCAGCG CGTGGCCGCG ATCGAGGAGC GCCAGCGGCT GACGCTCAGC
GGCGGCCACG AGATCGACAT GGTGCCGGCG CGGCATCTGG TGTGGCATCG CAAGAAGATC
CTGCCCGTGC ACTCCAACGG GCTGCGCTTC GCGGCCCTGG GCGACGACGG CCTGGAGCTG
AGCAGCCGTA TCTACTACTC GGTGGGCGGC GGCTTCGTGG TCCCGGGCGA CACCGACGTC
GAGGCCCCGT TCATCGCGCC GCGTCCGAGC GCGCCGTATC CGTTCACCAG CGGCGACGAG
TTGCTGGCGC AGTGCGCCGA GCACGGCATC TGCATCAGCA CCTTGATGCT CGAGAACGAA
AAGGCCTGGC GCAGCGAGGA CGAGGTGCGC GGCGAGCTGT TGCGCATCTG GGGCGTGATG
CAGGCGTGTA TCGAGCGCGG CTGCCGCAGC GAGGGCATCC TGCCCGGCGG GCTCAAGGTC
AAGCGCCGGG CCGCCGCCAT CCATCGCAAG CTGCGCGCCG AGGCCAGCAG CGTCGGCAAC
AGCGCGCTGG TGCTCGACTG GGTCAACCTG TTCGCGATCG CGGTCAACGA GGAGAACGCG
GCCGGCGGGC GCGTGGTCAC GGCGCCGACC AATGGCGCGG CCGGCGTCAT CCCGGCGGTG
CTGTCGTATT TCGTGCGCTT CTGCGGCAGC GACGCAACCG AGGAGGGCGT GGTGCGCTTT
CTGCTCACGG CCGGCGCCAT GGCCATTCTG TACAAGATCA ACGCGTCGAT CTCGGGCGCC
GAGGTCGGCT GTCAGGGCGA GGTCGGCGTG GCCTGCTCGA TGGCGGCCGC GGGCCTGGCC
GAGGTCCTCG GCGGCAGCCC CGAGCAGGTC GAGAACGCGG CCGAGATCGG CATGGAGCAC
AACCTCGGCC TCACCTGCGA TCCCGTCGGC GGGCTGGTGC AGGTGCCGTG TATCGAGCGC
AACGCCATGG GCGCGGTCAA GGCCATCAAC GCGGCGCGTC TGGCGCTGCG CGGCGACGGC
AAGCACACGG TGTCGCTCGA CAAGGTCATC CGCACCATGC GCCAGACCGG CGCCGACATG
AAGGCCAAGT ACAAAGAGAC CGCGCGCGGC GGCCTCGCGG TCAACATCGT CGAGTGCTAG
 
Protein sequence
MSISIFDMFT VGIGPSSSHT VGPMRAARRF VAHLEARGTL AATRSLRVEL FGSLGHTGKG 
HGTDRAVLMG LEGEDPETVD PGTIGQRVAA IEERQRLTLS GGHEIDMVPA RHLVWHRKKI
LPVHSNGLRF AALGDDGLEL SSRIYYSVGG GFVVPGDTDV EAPFIAPRPS APYPFTSGDE
LLAQCAEHGI CISTLMLENE KAWRSEDEVR GELLRIWGVM QACIERGCRS EGILPGGLKV
KRRAAAIHRK LRAEASSVGN SALVLDWVNL FAIAVNEENA AGGRVVTAPT NGAAGVIPAV
LSYFVRFCGS DATEEGVVRF LLTAGAMAIL YKINASISGA EVGCQGEVGV ACSMAAAGLA
EVLGGSPEQV ENAAEIGMEH NLGLTCDPVG GLVQVPCIER NAMGAVKAIN AARLALRGDG
KHTVSLDKVI RTMRQTGADM KAKYKETARG GLAVNIVEC