Gene Hoch_3229 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3229 
Symbol 
ID8545617 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp4448348 
End bp4449574 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content66% 
IMG OID646387896 
Producthypothetical protein 
Protein accessionYP_003267624 
Protein GI262196415 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0914221 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.650487 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCTCGT TCGACAGTCA TCGCCTCAGC CGCATCCTGC CCCTGCGGGT GCGCAATCAG 
CTCGACGCCT ACTGGGAGAT GCTCGGCCTG GTGCGCGAGA TCAACAACCC GCGGGTGCTG
CGCGCGCTCG GGCCCTCGGG GCTGCGCGGC CTGCTCTTGC GCCGCGGCAA GCAGAACGTG
CCCAGCAACT TCCAGGCCCG TCACAAGGCC CACTTCAACT GGTCGTATCC CAGCGACAAC
GGCGAGATGG CCGAGCTGTA CGCGCGCGCC AAGCAGGGCC AGTGGGACGG CGACAGCTAT
CTGCCCTGGG ACATCGATGT CGATCCCGAA AACCCCGAGC GCGCGATCAT TCCCGAGCAG
TTTTTCGCCT TCGAGCTGCT CGCCGAGTTC GGCGTACGAC TGTCCGAACG CGAGCGGCGG
CAACTGCTGC ACAGCATGGC GGCGTGGATG CTCAGCCAGT TCTTGCACGG CGAACAGGGC
GCGCTGATGG CGGCCGCCCA GGTCACCGAG GCGGTGCAGT TCTTCGACGG CAAGCTCTAC
GGCTCGACCC AGGTGGTCGA CGAGGGCCGC CACGTCGAGG TGTTTCACCG CTACCTCGAC
ACCAAGCTCG AAAAGCTCTA CCAGATCAAC GACAACCTGT TCGTCATCAT CGACGCGCTG
ATGGAGGACA GCCGCTGGGA CATGAAGTTC CTCGGCATGC AGATCATGGT CGAGGGGCTG
GCCCTGGGCG CTTTCGGCGT GCTGTACCAA AACACCCGCG AGCCGCTGCT CAAGGAGCTG
TTGCGCATGG TCATCCAGGA TGAGGCCCGG CACGTGCACT ACGGCGTGCT GGCGCTGCGC
GAGCATTTCC GCGAGGCGCT GAGCGAGCGC GAGCGCATCG AGCGCGAGGA CTGGGCCTTC
GAGGTGGCGC TGCTGATGCG CAATCGCTTC ATGGCCTACG AGGTGTACGA GGAGTGGTTC
GAGGGCAGCT TCAGCCGCGA CCAGTGGCGC CGCTTCGTGG CCGCGTCGCC CGGCTTCGAG
CAGTTCCGCC ACGTCATGTT CAACCGCCTG GTGCCCAATC TGCGCGAGAT CGGGCTGATG
TCGCCGCGCA TCCAGGAGCA CTACGCCGAG GTCGACCTGA TGAAATACTT CGGCAACGCG
GCCGCGGACC AGCTCTCGGG CGAGCAGCTC ATCCACGAGC TCGACGCCAG CGCGCCCAAG
GACACACTCG GCGTCGCCAC AGCGTAG
 
Protein sequence
MFSFDSHRLS RILPLRVRNQ LDAYWEMLGL VREINNPRVL RALGPSGLRG LLLRRGKQNV 
PSNFQARHKA HFNWSYPSDN GEMAELYARA KQGQWDGDSY LPWDIDVDPE NPERAIIPEQ
FFAFELLAEF GVRLSERERR QLLHSMAAWM LSQFLHGEQG ALMAAAQVTE AVQFFDGKLY
GSTQVVDEGR HVEVFHRYLD TKLEKLYQIN DNLFVIIDAL MEDSRWDMKF LGMQIMVEGL
ALGAFGVLYQ NTREPLLKEL LRMVIQDEAR HVHYGVLALR EHFREALSER ERIEREDWAF
EVALLMRNRF MAYEVYEEWF EGSFSRDQWR RFVAASPGFE QFRHVMFNRL VPNLREIGLM
SPRIQEHYAE VDLMKYFGNA AADQLSGEQL IHELDASAPK DTLGVATA