Gene Hoch_2366 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_2366 
Symbol 
ID8544752 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp3283120 
End bp3284268 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content68% 
IMG OID646387066 
Producthypothetical protein 
Protein accessionYP_003266797 
Protein GI262195588 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.18293 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGATTCA ACCGTGTCGC CGCCGCGCTA CTGGGCGCGC TCGCCGTTTC GTTGGCCACC 
GCCCCGGCCT TCGCGCAGTC GCCGGCCGAG CGCGCCGACA AACTCAACAC CGAGGGCAAG
CAGCTCTGGG AGAAGAAGGG CGATGTCGCC GGCGCGGCCG AGAAGTTTCG CCAGGCGACC
GTGCTCAGCC CCGAGGGTCG CTTCTACTTC AACCTCTGCT ACGCGCTGCA CACCCTGCAG
CAGTATCGCG AAGCGCGCAC CGCGTGCCTG GCGGTCGAGC CCAACGGCGC CGAAGCCAAG
GTGGTCGAAA AAACCCAGAT CATCCTCGCC GACATCGACA AGATCGTGCC GCCCGAGCCG
GTAGACGAGC CCGTCGACCC GGTCGATCCC AACACCGACC CGGACTACGA TCCCAACGCC
GACCCGGACT ACGATCCCAA CGCCGACCCG AACTACGATC CCAACGCCGA CCCGAACTAC
GACCCCAACA CCGACCCCAA CGCCTACCCG CCGGACGGCC AGGGCCCGGG TTACGACAGC
GGGACCGGCG CGCCGCGGCC GCTGCCCGAG CTGGAAGAAG CCGCCGATCC CCTGGGCACC
TACTCCTGGT CGGTGGGCGT CGACCTCGGC CCCTCGGCGT TCTCCATCGG CGCCGAGGAC
TTCTACGTGT CCACGGGTGG CAGCTTCAAG GCCCACGGCA ACCTCGTGCT CATGCCCGAG
CGCCAGGCCG GCGTGCAAGG GTATCTCGGC ATCACCCCGG TGACGGCCGA CGTCGCCATC
GACGACCTGT TGATCGTGGA CCTCGGCGTG GCCGTGTTCA AGCACCTCAA CTTCGACCGC
CTGGTCTTCA CCCCGCTCGT GGGCCTGCAC CTGGCCGCGT TCCAGCCCTC GGACGGCGGC
GACGAAGCCT CGGGCGCGCT GGGTCTGCGC GGCGAGCTCG GCGTCTCCTG GCTGCTCGAC
AGCCGGGGCA AGCACGTGGT GAGCGTCACC CCGGCGTTCA ACCTCTACAG CGCTGGCTCA
GGCACCTCGT CCACCAGCGC CGAGTTCTAC GGACTCGACG AGCCCTCGGC CACCTTCGGC
ATCAGCCTGG GCTACACCTT GCGCTTCATG GAGCCCTTCG GCGACGGCCC GATCTTCGTG
CTCGAGTAA
 
Protein sequence
MRFNRVAAAL LGALAVSLAT APAFAQSPAE RADKLNTEGK QLWEKKGDVA GAAEKFRQAT 
VLSPEGRFYF NLCYALHTLQ QYREARTACL AVEPNGAEAK VVEKTQIILA DIDKIVPPEP
VDEPVDPVDP NTDPDYDPNA DPDYDPNADP NYDPNADPNY DPNTDPNAYP PDGQGPGYDS
GTGAPRPLPE LEEAADPLGT YSWSVGVDLG PSAFSIGAED FYVSTGGSFK AHGNLVLMPE
RQAGVQGYLG ITPVTADVAI DDLLIVDLGV AVFKHLNFDR LVFTPLVGLH LAAFQPSDGG
DEASGALGLR GELGVSWLLD SRGKHVVSVT PAFNLYSAGS GTSSTSAEFY GLDEPSATFG
ISLGYTLRFM EPFGDGPIFV LE