Gene Hoch_3444 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3444 
Symbol 
ID8545832 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp4756532 
End bp4757611 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content73% 
IMG OID646388111 
ProductThreonine aldolase 
Protein accessionYP_003267839 
Protein GI262196630 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2008] Threonine aldolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0251232 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACTG TCATCGATCT TCGCTCCGAC ACGGTCACCC AGCCCACGGC CGAGATGCGC 
CGCGCCATGG CCGAGGCCGT GGTCGGCGAC GACGTCTACG GCGAGGACCC CACGGTCAAT
CAGCTCCAGG AGCGAGTCGC CGCGCTCCTG GGCACCGAGG CGGCCCTGTT CGTGCCCTCG
GGCAGCATGG CCAACCAGAT CGCCATCAAG GTGCACACCC AGCCCGGCGA CAGCGTCATG
GTCGGCGCCC ACGCCCACAA CTGGCTGTTC GAAGCCGGCG GCGCCGGCGC GATCTCGTCG
GTCCAGGTCG ACGTCCTGCC CGGCGACGGC CGCTTTGACG CCGCCGCCAT GCGCGAGTCC
TACAAGCCCG ACAATCACCT GTTCGCGCCC ACGCGCCTGG TCTCGGTCGA GAACACGCAC
AACATGGGCG GCGGCCTGGT GTGGGACGAC GAGCCTCTGG CCGCGGTGCT GGCGTGTGCG
CGCGAGCTCG AGCTGGGCAC GCACCTCGAC GGCGCCCGGC TGTGGAACGC GGCCGCGCGC
ACCGGCCGCT CCGAGGCCGA GCTGACCGCC GGCTTCGACA CCATCGCGGT GTGCCTGTCC
AAGGGCCTGG GCGCGCCCGT GGGCTCGCTG CTGTGCGGCA CCCGCGCGCT GGTCCACAAG
GGTCACCGGG TGCGCAAGAT GCTCGGCGGC GGCATGCGCC AGGCCGGCAT CCTGGCCGCG
GCCGGGCTGT ACGCGCTCGA GCATCACCGC CCGGGCCTGA CCCAGGACCA CGACAACGCC
CACTACCTGG CCGCGGAGCT GGCCGCGGTG CCCGGCTTCG CGGTCGATGT CGCGCGCGTG
CACACCAACA TCGTCATGGT CGACGTCGTC GACAGCGCGC TCGACGCCCA GCGCATCGCG
GCCGCCGCGG CCGAGCGCGG CGTGCGCGTC CACGGCATGT CGCCGCGGCG CATGCGCCTG
GTCACGCACC GCGAGCTCGA CCGCGCCATG TGCACGCGCG CGATCGAGAC CCTGGCCGCG
CTGGCCGGCG CTCCCGGCTC GGCATCGTCG AACGCGGCGA CGCGCGCCGC CCATGGCTGA
 
Protein sequence
MTTVIDLRSD TVTQPTAEMR RAMAEAVVGD DVYGEDPTVN QLQERVAALL GTEAALFVPS 
GSMANQIAIK VHTQPGDSVM VGAHAHNWLF EAGGAGAISS VQVDVLPGDG RFDAAAMRES
YKPDNHLFAP TRLVSVENTH NMGGGLVWDD EPLAAVLACA RELELGTHLD GARLWNAAAR
TGRSEAELTA GFDTIAVCLS KGLGAPVGSL LCGTRALVHK GHRVRKMLGG GMRQAGILAA
AGLYALEHHR PGLTQDHDNA HYLAAELAAV PGFAVDVARV HTNIVMVDVV DSALDAQRIA
AAAAERGVRV HGMSPRRMRL VTHRELDRAM CTRAIETLAA LAGAPGSASS NAATRAAHG