Gene Hoch_3550 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3550 
Symbol 
ID8545940 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp4887178 
End bp4888620 
Gene Length1443 bp 
Protein Length480 aa 
Translation table11 
GC content70% 
IMG OID646388219 
Productprotein of unknown function DUF147 
Protein accessionYP_003267945 
Protein GI262196736 
COG category[S] Function unknown 
COG ID[COG1624] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00159] conserved hypothetical protein TIGR00159 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000131066 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0716856 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTAGAGG ACCTCTTCAG CACGCCGTGG CGCGCGGTGA TCACGTGCCT CGACATCCTG 
ATCATCTACT ACGTGATCTA CCGCGTGCTG CTCACCATCA AGGGCACGCG CGCGGCGCAG
ATGGTCATCG CCATCGTGCT GATCTGGGCC TCCTCGCAGG TGGCCGAGCG GCTCGAGATG
ACCATCGTCT CGTGGCTGCT CGATAATTTC GTCAACTACT TCATCATCAT CATCATCGTC
CTGTTCCAGC AGGACATCCG CCGCGCGCTG ATGCGCATCG GCCAGAACGT GGTGCCCTTT
GGCAAGACCC ACCAGCTCTC GCACGCGCTC GACGAGGTGC TGGCCGCGGC CCAGCACCTG
GCCCGTGCCC GTCTGGGCGG CATCGTGGTG TTCGAGCGCG AGGCCGACGT GCGCCAGTTC
GTCGATGCCG GACGCGTGGT CGACGCCCAG GTGTCCAAAG AGCTGCTCGT GGCCCTGTTC
GTGCCCTCGC GCGACAACGA GCTGCACGAC GGCGCGGTCA TCATCGGCAA GAGCATGCGC
ATCGATCGCG CCGGCATCCT GCTGCCGCTG TCGCGCAACA ACACGCTGAG CCAGGAGTAC
GGCACGCGCC ATCGCGCCGC GCTGGGCATC ACCGAGGAGA CCGACGCCGT GGTCCTGGTC
ATCTCCGAGG AGCGCGGCGA GATCTCGCTG TGCTTCCGCG GCAACATCGC CCGCGATCTA
GAGATCGACA CCCTGCGCGA GCGTCTGCAG GCGCTGTTCG ACGCCGAGGC CACCGACAAG
ATCGCCAGCG AAGCCGAGGC CGCTGCCAGC ATCGGTCGCG CCATGGCCGC GCTGGCCGCC
ACCGAGAGCG CCGGCAGCGA CGACACCAAG CCGCCGCCCA GCCGCTCGAC CTCGCGGGTC
CCGGTGACCC CGGTGGCGAC GGCGAGCGAG CGCACCGAGC GCCACGTCCA GCCCACCGAC
CGACTCGGCG TGGTCGAGCG CGCGGCTTCG AACAGCGACA CCTCGTCGCA GAACGCGGGC
AAGCCCAGCG CGGTCCCCAG CGCGCCCCGC GCGATCCCGG GTGAGGGCAG CGAGGTCCAG
ACCTCGCGCA TGAGTCAGCC GCCGGCCGCG GCGCGGCCGC AACGAGCGGC GGCCGCGGCG
GCGTCCACCC CGGCGGCGGG TGGCGGCGCG GCGGCCCCCG AGCGCTTCGC CAAGAAGAGC
GCCGAGCTGG TGCCGACCAG CGACGCCGCC AGCACCGGCT CGCTGTCCAC CGAGCGCTTC
GCCAAGAAGA GCGCCGACGT GGTGCCGACG AGCGACAAGT ACGCCAAGAC CAGCGAGAAG
ATCGCGCCCA CCAGCGAGCG GCTCGGCAAG ACCACCAGCG AGCGCAGCGA CAAGCACGTG
CAGCCGGGCG AGAGCCGCGA CCGCCGCACG GGCGAACACG TACTGCCGGA GCGCAATCCA
TGA
 
Protein sequence
MLEDLFSTPW RAVITCLDIL IIYYVIYRVL LTIKGTRAAQ MVIAIVLIWA SSQVAERLEM 
TIVSWLLDNF VNYFIIIIIV LFQQDIRRAL MRIGQNVVPF GKTHQLSHAL DEVLAAAQHL
ARARLGGIVV FEREADVRQF VDAGRVVDAQ VSKELLVALF VPSRDNELHD GAVIIGKSMR
IDRAGILLPL SRNNTLSQEY GTRHRAALGI TEETDAVVLV ISEERGEISL CFRGNIARDL
EIDTLRERLQ ALFDAEATDK IASEAEAAAS IGRAMAALAA TESAGSDDTK PPPSRSTSRV
PVTPVATASE RTERHVQPTD RLGVVERAAS NSDTSSQNAG KPSAVPSAPR AIPGEGSEVQ
TSRMSQPPAA ARPQRAAAAA ASTPAAGGGA AAPERFAKKS AELVPTSDAA STGSLSTERF
AKKSADVVPT SDKYAKTSEK IAPTSERLGK TTSERSDKHV QPGESRDRRT GEHVLPERNP