Gene Hoch_4033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_4033 
Symbol 
ID8546434 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp5537757 
End bp5538881 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content70% 
IMG OID646388710 
ProductOmpA/MotB domain protein 
Protein accessionYP_003268425 
Protein GI262197216 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2885] Outer membrane protein and related peptidoglycan-associated (lipo)proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.659004 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.277798 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCAGG CCCTGCGCGC GGACGCGCGG ACGACTTCGC GGCTGCCGCG GCCGACGCGC 
GCGCTGGCGG CGATGGTGCT GGCGGCGCTC ATGGCGCTGG CCGGCGGCTG CGGCGGTGGC
GCCGAGCTGC GCGGCCGCAG CCAGACGGTG GACTCGCTGG TGGCCACGGC GCGCGAGAAC
GGCGCCGAGC GTTGCGCGCC GGTAGAGCTG GCGCTGGCCG AGAGCCACGT GGCCTTTGCC
AAGCAAGACC TCGACGAAGG CGATCCATTC CGCGCCCGCC GCGAGCTGGA GATCGCCGAG
AGCAACGCCC GCGAGGCGCT GCGCCTGTCG CCCAAGGATC CCTGCGTGCC GCCCGAGGTG
GTCGCCGTGG ACAGCGACGG CGACGGCATC TTCGACGATC GCGACGCGTG CAAGGGCGCG
CCCGAGGACA AGGACGGCTT CGAGGACGAG GACGGCTGCC CCGACCTCGA CAACGATCAG
GACGGCATCG TCGACGCCAG CGACGCCTGC CCGCTCGAGC CCGAGGACAA GGACGGGCTC
GACGACGAGG ACGGCTGCCC CGAAGAGGAC CGCGACGGCG ACATGATCGC CGACAACAAG
GACCAGTGCC CGGACGAGCC CGAGGACAAG GACGGCTTCG CCGACGAGGA CGGCTGCCCC
GACTGCGACA ATGACGGCGA CGGCGTGCCC GAGTGCCCGG TGGTCGTGGA CCAGTGTCCG
AGCAAGGCGG CCAAGACCCC GGACGGCTGT CCGGTTTACA ATCTGGTCAA GGTCACCTCG
AAGAAGATCG AGATCAAGCA GACCATCTAC TTCGAGACCG GCAAGAACAC CATCAAGCCG
GTGTCCTTTG CGCTGCTCAA CGAGGTGGCC ACGGTGCTCA CCGACAACCC CGAGATCGAG
GTGCGCATCG AGGGTCACAC CGACAGCCGC GGCAGCGCCG AGTTCAACAT GGAGCTGAGC
CAGAGCCGGG CCGAGTCGGT GCGCAGCTTC CTCATCGACA AGGGCGTGGA CGGCGACCGC
CTCGAGGCCA AGGGCTACGG CGAGAGCGCG CCCATCGCCA ACAACAACAC CCGCGCCGGC
CAGGCCCAGA ACCGGCGCGT GGAGTTCGTG ATCGTCAGCC GTTAG
 
Protein sequence
MSQALRADAR TTSRLPRPTR ALAAMVLAAL MALAGGCGGG AELRGRSQTV DSLVATAREN 
GAERCAPVEL ALAESHVAFA KQDLDEGDPF RARRELEIAE SNAREALRLS PKDPCVPPEV
VAVDSDGDGI FDDRDACKGA PEDKDGFEDE DGCPDLDNDQ DGIVDASDAC PLEPEDKDGL
DDEDGCPEED RDGDMIADNK DQCPDEPEDK DGFADEDGCP DCDNDGDGVP ECPVVVDQCP
SKAAKTPDGC PVYNLVKVTS KKIEIKQTIY FETGKNTIKP VSFALLNEVA TVLTDNPEIE
VRIEGHTDSR GSAEFNMELS QSRAESVRSF LIDKGVDGDR LEAKGYGESA PIANNNTRAG
QAQNRRVEFV IVSR