Gene Hoch_1734 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_1734 
Symbol 
ID8544116 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp2361890 
End bp2363737 
Gene Length1848 bp 
Protein Length615 aa 
Translation table11 
GC content64% 
IMG OID646386441 
Producthypothetical protein 
Protein accessionYP_003266176 
Protein GI262194967 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00167071 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCAAAT GGAGATGGTT GTGTATGTTC GCGCTCGCGA TGCCGCTGTG GGCGTCGGGT 
TGCGCCTGGC TGGTTCTCGA GGAGCCGGCG TGCACTTCGG GCGAGACCTG TCCGGGCGAC
TTCGTGTGCG ACCCGGCCAC CGGCCTGTGC CTCACCCCGT GTCCGCCCGG CAGCGAGCTG
TGCGACGGCA TGTGCGTCAA TCCGCAGCAG AACAACAGCC ACTGCGGCGC CTGCGGCGAG
GTCTGCGGCG ACGGCGAAGT GTGCGCAGCC GGCGAGTGCG CGGCCACCTG TGGGCCCGGC
ATCAGCACCT GCGAGGGCGA GTGCGTGGAC ACCGACATCG ATCCCGCGCA CTGCGGCGCC
TGCGGCAACG CCTGCGCGGC CGACGGCTAC TGCGTGGGCG GCGCGTGCGC GGCCACCTGC
GCGCCCGGCA CCGAGCCCTG CGGCGGCATC TGCAGGGACA CCGCCAGCGA CCCGGACCAC
TGCGGCGGCT GCGGCCTGAG CTGCGGCGAG GGCGAGCTGT GCGAGGCTGG CGCGTGCACC
TGCCCGAGCG GACTCGACGA GTGCGCGGGC ACCTGCATCG ACTTCGACAG CGAGCCCGCG
CACTGCGGCG CCTGCGGCAA CGCGTGCTTG GACGGCCAGA ACTGCGTCGA TGGCGCGTGC
GCGTGCCCGG CCGGGACCAC CCTGTGCGGC GGCGCGTGCG TCAACCTCGA CACCGACGCC
GAGCACTGCG GCGTGTGCGA AAACCAGTGC AGCAGCGGCA GCGCGTGCAG CGGCGGCGCG
TGCGTGTGCG TTCCCGGACA GACGTACTGC GACGGCGCGT GCACCGACCT CGGCGACGAT
GACGCGCACT GCGGCGCTTG CGGCAACGCC TGCGGCGGCG GCATGAGCTG CGAGGGCGGC
GCGTGCGAGT GCCCGGCCGG CTTCACCGAG TGCGGCGGCG CGTGCAAAGA CCTCGAGCGC
GACGTGCTCT CGTGCGGCTC GTGCGGTCAC GCTTGCGACT CGGGCAAGGC GTGTGTCGAT
GGTCAGTGCG TGTCGCCGTT TACCGGTACT CGGACAGCGC TGCCACTGCC CAACGGTTGC
TCGGCCTTTC CGCTATCGCC CGACGAGTGC GGCACGCCGA TCGCCACGGA GACAGTGCCG
TTCGAATTCG ACGACCTTGA TGAGACAATT ATATATAATC CCGATGAGCC ATTTGCTAGC
AACGAATCTC TCGTTTTATC GGGCGATGCG CGGTCGTTTG ATCATGGTGG TGCCCTAGGG
TACGTTGGGT TTTTCGATGA GGGGGAGCAT TCACTCACGC AAAGGAGCAT TCTCGCAGGG
TCTAGCTGGG GAGCTTTCTC TCTCGGTATC CGCTCCGGTA TTCGATCCTG TGCGCGCCCT
TCGAGTCTCC GGTTCTGGGT TAGATTCCCT GTTCTAGATA TTGTCGCTAC TCTCGTTCGT
ACTACCCTTA CAGGCAAATA TAACACCGGT GGAATAGGGT TTTTCGAGGC CACATCTTTA
CAGATAAATC CAGAAACTGG TGAAACCTGT GACCAAGTGT GCGGCTATAT CTCAATGCGA
TGCGATGACA CGCAGCAGTG GTACAGGGCG ACGATTCCCC CTCGTAAGAC CCTCGCTCTG
GAGATCGCCT TACGCAGTAG TGGTAGTACC TACTTCAATG TCGCCGTATT TCGAACAGAC
GAATCACAGA TTTTCCATGC GGCTAGTGGG AGCGTGGGCG GCTCCTACGG ATTCTACACT
GCGAGGCTCA GGAACAACCT CGACATTCCC CAGGACGTCG TATTGAGCGT CATCCCATGG
AACGGAGATG GTGTGCACTA TCAAATCGCG GCAGCGATAG AGCAGTAA
 
Protein sequence
MIKWRWLCMF ALAMPLWASG CAWLVLEEPA CTSGETCPGD FVCDPATGLC LTPCPPGSEL 
CDGMCVNPQQ NNSHCGACGE VCGDGEVCAA GECAATCGPG ISTCEGECVD TDIDPAHCGA
CGNACAADGY CVGGACAATC APGTEPCGGI CRDTASDPDH CGGCGLSCGE GELCEAGACT
CPSGLDECAG TCIDFDSEPA HCGACGNACL DGQNCVDGAC ACPAGTTLCG GACVNLDTDA
EHCGVCENQC SSGSACSGGA CVCVPGQTYC DGACTDLGDD DAHCGACGNA CGGGMSCEGG
ACECPAGFTE CGGACKDLER DVLSCGSCGH ACDSGKACVD GQCVSPFTGT RTALPLPNGC
SAFPLSPDEC GTPIATETVP FEFDDLDETI IYNPDEPFAS NESLVLSGDA RSFDHGGALG
YVGFFDEGEH SLTQRSILAG SSWGAFSLGI RSGIRSCARP SSLRFWVRFP VLDIVATLVR
TTLTGKYNTG GIGFFEATSL QINPETGETC DQVCGYISMR CDDTQQWYRA TIPPRKTLAL
EIALRSSGST YFNVAVFRTD ESQIFHAASG SVGGSYGFYT ARLRNNLDIP QDVVLSVIPW
NGDGVHYQIA AAIEQ