Gene Hoch_5086 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_5086 
Symbol 
ID8547497 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp7011614 
End bp7013215 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content67% 
IMG OID646389762 
ProductOmpA/MotB domain protein 
Protein accessionYP_003269467 
Protein GI262198258 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2885] Outer membrane protein and related peptidoglycan-associated (lipo)proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAAGG TCAAACTCAA ACCCGCGGCG AAGATCCTCA TCGTCATCAT CGTCCTGGCG 
GCCGGTTTCG CGCTGTACAA CTTCGTGCTC AAGGACATGC TGGCCAAAGG CGGCGGCGGC
GGCGACGGCG GCGGCGACGA AGTGGCCAAC GGCGACGGCG ACACCGGCAA CACCGGCAAC
ACCGGCAAGA AGGACGTGTA CCGCGTGGCG CTGTCCGAGT GGCCCGGGCA CATGCCCATG
GTGATCGGCA ACGGCGGCCT CAAGACCCAG GCCGGCTCGG CCGCCGACCG CGAGGGCATC
AAGCTCGAGA TCGTGTTCAT CGAGGACCCG ATCAAGAAGA ACGCGGCGCT GCAGAACGGC
TCGGTGGACT TCGTCTGGCA GGTGGTCGAC GAGATGCCCA TCAACATGGG CGGCTACAAG
CAGTCGGGCG TCGAGCCGCG CGCCTTCCTG CAGCTCGACT GGTCGCGCGG CGGCGACGCC
TGCGTGGCCT CCAAGGAGAT CGAGACGGTC GAGGACATCC TCGGCCACAA GTCGGCGATG
ATGATGTTCT CGCCCGATCA CACGGTGTTC GAGTTCATGA TCAACAACTC GCGGCTCACG
CCCGATCAGA TCGCCCAGGT GCGCCAGGAC ACCAGCTTCT CCATGGACGA CTTCACCTAC
GGCCGCGTGC TCTTCGTGCA GAACAAGGTC GATGTCGCCT GTCTGTGGGA GCCCGACGTC
ACCCTGGCGC TCGAGGGTCG CCCCGGCGCC CATCGCCTGT TCTCCACGGC TGACGCCACC
GAGCTGATCG CCGACGTGCT GCTGACCCGC CAGGACACCC TGGCCAGCAA TCAAGACGTC
GCCGAGAAGG TGGCCAAGGT GTGGTTCGCC GGCGTCGAGC AGGCCGAGAG CGACCGCCAG
GCCGCGGCCC GCTTGATCGC GCAGGTGGTG CCGCGCTTCC GCGACGAGCT GGGCGTGGAC
GGCACCCTGG GCGCCTTCGA GTGGGTGAAG TGGACCAATC TGTCCGACAA CGCGCGCTTC
TTCGGCGTCT CCGGCGGCCA GGTGGCCTTC GACCGGGTGT ACAACCAAGC CGACGGCGTG
TGGACCCAGT ATCCCAAGGC CGAGATCACC GACCGCTTCG CGCCCAGCGC GCTGCGCAAC
GACAAGATCG TCGCCGAGCT GTGGGAGAAC CGCCCCGACG ACGAGCCGGT CGCGGCCACC
ACCCGGCCCG AGCCCGAGTA CAAGCCCGAG GTCGCCGACA CCGGCCGCGC GGTGTTCACC
AAGCCGGTGA CCATCAACTT CGACACCGGC CAGAGCGCGC TCGACCCCGA GTCGATGCAC
CTGCTCAACA CCCAGGTGCT GCCGCAGCTC GAGATGGCCG GCGGCATGTA CGTGCGCATC
GAGGGCAACA CCGACAACGT CGGCGACAAG CGCGGCAACC AGGCGCTGAG CGAGGCCCGC
GCGCAGTCGG TTCTCGACTA CCTGGTCCAA AAAGGTATCA ACGCCAAGCG CCTGTCGGCC
AGGGGCAACG GCTCCGACAG CCCGGTGGCC AGCAACAAGA CCGCGGACGG GCGGGCGGCC
AACCGCCGCA CCGACATCGT CTTCATCAGC GGCCAGGAGT AG
 
Protein sequence
MAKVKLKPAA KILIVIIVLA AGFALYNFVL KDMLAKGGGG GDGGGDEVAN GDGDTGNTGN 
TGKKDVYRVA LSEWPGHMPM VIGNGGLKTQ AGSAADREGI KLEIVFIEDP IKKNAALQNG
SVDFVWQVVD EMPINMGGYK QSGVEPRAFL QLDWSRGGDA CVASKEIETV EDILGHKSAM
MMFSPDHTVF EFMINNSRLT PDQIAQVRQD TSFSMDDFTY GRVLFVQNKV DVACLWEPDV
TLALEGRPGA HRLFSTADAT ELIADVLLTR QDTLASNQDV AEKVAKVWFA GVEQAESDRQ
AAARLIAQVV PRFRDELGVD GTLGAFEWVK WTNLSDNARF FGVSGGQVAF DRVYNQADGV
WTQYPKAEIT DRFAPSALRN DKIVAELWEN RPDDEPVAAT TRPEPEYKPE VADTGRAVFT
KPVTINFDTG QSALDPESMH LLNTQVLPQL EMAGGMYVRI EGNTDNVGDK RGNQALSEAR
AQSVLDYLVQ KGINAKRLSA RGNGSDSPVA SNKTADGRAA NRRTDIVFIS GQE