Gene Hoch_2788 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_2788 
Symbol 
ID8545176 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp3824265 
End bp3825734 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content46% 
IMG OID646387480 
ProductSEFIR domain protein 
Protein accessionYP_003267208 
Protein GI262195999 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0246678 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGATG CGATTGAGAG CGATTCAGAA GTTCCTAAGC TGTTTATCTC TTACAGTTGG 
ACGAGTGAAG AGCATCAGAA CTGGGTGTTG CAGCTAGCTA ACGACTTGGC CGATAACGGG
GTTCATGTCG TGATCGATAA ATTTGATCTT CGTCCTGGTC ACGATGCGTT TAAGTTTATG
GAGCAAATGG TCACAGATCC TTCAATTTCT AAAGTGCTCA TTGTTTCTGA TGAGGCATAT
GCAACCAAAT CAGATGAGCG CATGGGTGGC GTTGGGATCG AGGCCCAAAT TATAACGCCG
GATGTTTACA AACAGGCCCA GCAGTCAAAA TTTGTAGTGC TAGTGCGTGA ACTTAAACCA
GATGGCCAAC CATGGGTACC TGCATATTAC AATTCGAAGT TGTACATCGA CTGCTCTGAT
CTGAACGAGT ACGAAGAGAA ACTCGAAGAA CTCATTCGCT GGGCCTTCGA TAAGCCGTTG
CATCGGCGGC GTCCGCTCGG GAGGCCACCT GCTTATCTAT TTGAGACGCC CAGTGTCGAC
CTAGGTACTC GGTCAGCTCA ACAACGTTGT GTCAAAGCAA TCCGAAGTGG TGCTCAATCA
GCATCCGGCG CACTAGATGA ATACATCAAC ATCTTTGTCG ACAATTTGGG GCGCTTTTTC
TCAGAAATAT TAGATGAAGA CGATAAGACG GACAGCTCGC CCTCAGAATC GTTCAATAGG
GCGTTGAGAC TATTTCTTCC CTATCGAGAT GAGATGCTTC AGGTGTTTAG TGCTGTGGCT
TCTTGTGCAA CGGTTTCAAA TAATCAGTAT TTGAATACTG CCATTAGATT GTTTGAGAAG
CTGGGCTGCT ACACTAGGGG CACCATGCGC AATCCACGGA ATCGTGCGGA TGTAGATTTG
TTTTCTTTTG TAGTGCATGA ACTATTCTTG TACCTGCTGG CTTTGTTTTT TGAGAAGGGG
CGGTTAGAGG ATATTGATTC AATATTGAGC CATCGATACT ATGTGAGGTG CGACTCTGGT
TCTGGTTATA CGGATGCGTA TTCTTATTCT ATTTTTTACT CGCCTGTCGA AATTCTTTAT
AATCCAAGAG CTTGGGTGGT AGAGGGAGCG CATGCCGCAT TACTTCGCCA GCGTAGCTAT
TCCGGGATAG AATTCAGGTA TGTTATGGAG ATGGATTTTG TTGCTCACCT GAGGTCAGAA
ATCGCGTCCG AGACAAGTGG CGGATGGCAT CCAGTAACTT TGTGCCTCAA AGAGGGTGTT
GAAACGCTGG AAGTATTCCA ACGATCGGTT TCTAAATCCT ACTTTGAACG AGTTCTGTTG
ATGCTCGGTG GCGTGTCGAA GGCTGACATC GAAGCCTACG TAGAGTCTGC TGTGGGCGGT
CGCACGGCGA GACAGATTCC TCTTCAAATA AAAACACCTA GGCATATACA GAATCTTTTT
GATTTAGAAA ATTTAGGTGT GATAAGATGA
 
Protein sequence
MADAIESDSE VPKLFISYSW TSEEHQNWVL QLANDLADNG VHVVIDKFDL RPGHDAFKFM 
EQMVTDPSIS KVLIVSDEAY ATKSDERMGG VGIEAQIITP DVYKQAQQSK FVVLVRELKP
DGQPWVPAYY NSKLYIDCSD LNEYEEKLEE LIRWAFDKPL HRRRPLGRPP AYLFETPSVD
LGTRSAQQRC VKAIRSGAQS ASGALDEYIN IFVDNLGRFF SEILDEDDKT DSSPSESFNR
ALRLFLPYRD EMLQVFSAVA SCATVSNNQY LNTAIRLFEK LGCYTRGTMR NPRNRADVDL
FSFVVHELFL YLLALFFEKG RLEDIDSILS HRYYVRCDSG SGYTDAYSYS IFYSPVEILY
NPRAWVVEGA HAALLRQRSY SGIEFRYVME MDFVAHLRSE IASETSGGWH PVTLCLKEGV
ETLEVFQRSV SKSYFERVLL MLGGVSKADI EAYVESAVGG RTARQIPLQI KTPRHIQNLF
DLENLGVIR