Gene Hoch_2036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_2036 
Symbol 
ID8544418 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp2809782 
End bp2812196 
Gene Length2415 bp 
Protein Length804 aa 
Translation table11 
GC content71% 
IMG OID646386739 
Producthypothetical protein 
Protein accessionYP_003266474 
Protein GI262195265 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.362684 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAACA GCGGTTCTTT TCAACGAGGT CCGGGGGCGT CCGCGGACGC GGAGCGGGGC 
GCGCGGCAGC AGCAGGGCTC ACGGCCGACG CCGGGCAAGG TCACGCGCAC CAGCAAGCTC
AGGCCGGTGC AGGCCAAGCG CGGCGGCGCG GACGGTGGCG CGGCGGCGGC AGGCGCGGGC
GGCGGCGGCG CGGCGGGCGG CGGAGGCGGG TTGCCCGGCG GCCTCAAGAC CGGGATCGAG
TCGCTCTCGG GGGTGTCGAT GGACGGGGTC AAGGTGCACT ACAACTCGCC GCGTCCGGCG
CACGTGGGCG CGCTGGCCTA CGCCCAGGGC AGCGACATCC ACGTCGGCCC CGGGCAGGAG
CAGCATCTGC CGCACGAGGC CTGGCACGTG GTGCAGCAGG CGCAGGGGCG GGTGGCGAGC
ACCACGCAGA TGAAGGGCGT GGAGCTCAAC GACGACGCCG GGCTCGAGCG CGAGGCCGAC
GTCATGGGCG CCAAGGCGCT CGCGATGGGC GGCTCGGAGG CGGCCGCGAC CTCGGCCGGC
CCGGCGGCGA GCGGTCCGGT GCAGGCCGTC CGAAATCCCG ACTTCGGCGG CGTGGTGGTG
CAGCGCTTCG GCTCGCTCGA GCACAAGACC CTGGGCGATC GCCCCACCGG CAGCGCCGAA
TACGACATGG GCGGACACAC GGCCGAGAGC GACGCCGACG CGTACAACGC GGCCTTCCGG
CTCACCCACG GCGACATCAT CATGCTCTCG GGTGACTTCT TCTCGCCGCG CGACACGCGC
ATGAACGAGC AGGGCGTCGA GGAGCCCGAC CCCGACTCGC TGTTCCGCAT CGCGGGCACG
CCCTCGCGCT CGCCGGGGCA GACCGTGGGC TCGTGGGATG AGGTGGTGTA CGCCATCAAG
AAGGCCATGC CCAACGACGA TCGCTTTCAG CCGGCCTCGC TGCCGGGCTT TCCCGATGGT
CACCCGTGGG CGAGCGTGCG CTTCTCGGCC GACGTGATGG CCGCGGTCGA CGCGCGCTAT
CTGCGCCGCG CGGCCAGCAA CGACGAGCAC TTCGTGGCGC CCACCGGCAC GGATCGCGGA
CCCACGGCCG GCGACCGCGC CTCGGCCGGC GGCTCGTATC GCGCGCTGCA CGAGGTGGCC
ATCCAGATGG CCTACGACGA GGGCACTGCC GCCACCGCGA TGGCCCGCGA GGCAGCGGCC
CAGCACTTCC TCACCGATCA CTTCGCGTCC GGCCACCTGC GCACGCCGCG GACCTCGATC
CGGGCCCACT GGCAGGCGAT CTACCCGCTG TTCTGGGACA ATATCCGCAA CAAGATCGCG
CTCGATGTCG CCACCTGGAT CAACGACAAC GAGAACCTGG GCTATCTGGC CTCGGTGGAC
CAGATCTACA CCGACATCCA GACCCAGGTG GTGGAGGCGA CCGCCGATAT TCCGCCCATG
GGCTTCGACG ACCTGGTGTC TCTGGTGACC CACGACTTCG ACAACGAGAA CGGGCTGTGG
GTGACAAACG ACCTCGGTGA GTCGTGGAAG CTGTTCGGCG ACGGCAACCT CGACAGCGCC
GACCCCGACA ACCGCACCCG CGAGATGAGC GAGCTGGCCG TGAGCCTGGG CAACGCCGAC
ATCGAGGCGG CCGCGCAGAT GGGCGCGAAC AAGGGCGATA CTCCGCTCAG CGCCGCGGAG
CTGTTCGACT CGGTGCGCAG CATCACGGGC GCGCCGGCCT CGGCCGGGGT CAAGTACGGC
CCCGAGCAGG TGCTGCCGCG CCTCGACAGC GCCCGCGCGT CCGAAAACGG CAGCTCGAAC
TGGCAGCAGT CGAGCATGGA CGAGCTGTGG CAGGCGCAGG TGACCAGCGC CACCGCGGCC
ACCTACGGCT CGGAGATCAC GGCCAGCATG CAGGGCGGCG AGCTGCAGCA CGAGCTGGCC
GGCATGGCCG AGAAGTTCCC CGAGTCGCAG GACGTGATGG GCGTGTTCAC GGTGCGCCCG
CGCCAGGCCT TCCTCGACGG CTTTCTCGGA CCGCTGGTCG CCAACCCGCG CACCGGTCTG
CAGAGCATCC TCGATTTCTC GCCCTCGCGC GGCCAGGCCG GCTTCAACGA GGACGACGCG
GTCATGCGCG AGATCGAGGG CGAGGACGGC GCGGGCGGCA TGAGCGACGA GCAGCTCGGG
GGCCTGACCA TGAACCAGCG CGCCGAGCGC ATCCGCGCGC TCATCGGCGG CTGGACGGGC
GAGGACGAGG GCGAGGTGGT GATCCGCATC TTCGAGACCA CGCCCGCGGG CGATCGCGCC
CGGCTCTACG AGATGGTCGA GGGCCACCCG TGGACGGGCA ACTGGCGCGA GGGCTTCTTC
GTGGTCGACG ACGACATCTG GGACGCGCTG TACGAGTCGC AGCTCGAGCG GCTGCGCGAT
ATCATCGGCC ACTGA
 
Protein sequence
MSNSGSFQRG PGASADAERG ARQQQGSRPT PGKVTRTSKL RPVQAKRGGA DGGAAAAGAG 
GGGAAGGGGG LPGGLKTGIE SLSGVSMDGV KVHYNSPRPA HVGALAYAQG SDIHVGPGQE
QHLPHEAWHV VQQAQGRVAS TTQMKGVELN DDAGLEREAD VMGAKALAMG GSEAAATSAG
PAASGPVQAV RNPDFGGVVV QRFGSLEHKT LGDRPTGSAE YDMGGHTAES DADAYNAAFR
LTHGDIIMLS GDFFSPRDTR MNEQGVEEPD PDSLFRIAGT PSRSPGQTVG SWDEVVYAIK
KAMPNDDRFQ PASLPGFPDG HPWASVRFSA DVMAAVDARY LRRAASNDEH FVAPTGTDRG
PTAGDRASAG GSYRALHEVA IQMAYDEGTA ATAMAREAAA QHFLTDHFAS GHLRTPRTSI
RAHWQAIYPL FWDNIRNKIA LDVATWINDN ENLGYLASVD QIYTDIQTQV VEATADIPPM
GFDDLVSLVT HDFDNENGLW VTNDLGESWK LFGDGNLDSA DPDNRTREMS ELAVSLGNAD
IEAAAQMGAN KGDTPLSAAE LFDSVRSITG APASAGVKYG PEQVLPRLDS ARASENGSSN
WQQSSMDELW QAQVTSATAA TYGSEITASM QGGELQHELA GMAEKFPESQ DVMGVFTVRP
RQAFLDGFLG PLVANPRTGL QSILDFSPSR GQAGFNEDDA VMREIEGEDG AGGMSDEQLG
GLTMNQRAER IRALIGGWTG EDEGEVVIRI FETTPAGDRA RLYEMVEGHP WTGNWREGFF
VVDDDIWDAL YESQLERLRD IIGH