Gene Hoch_3651 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3651 
Symbol 
ID8546041 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp5025794 
End bp5028037 
Gene Length2244 bp 
Protein Length747 aa 
Translation table11 
GC content74% 
IMG OID646388320 
ProductRhomboid family protein 
Protein accessionYP_003268046 
Protein GI262196837 
COG category[R] General function prediction only 
COG ID[COG0705] Uncharacterized membrane protein (homolog of Drosophila rhomboid) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.015996 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACGTCG ATTTCCTCCT GCTCTCGGTG GTCGCGGTCA CCGCCTACCT GGTGCCCGTG 
GTCTGGCGAC GCCGGCCGCC GGGACGCCGG GGCTTTGGCT GGCTGCTGCT GGCCGATGGC
GGCGCCGCCC TGCTGGCGCT GCTCGAGCCG CCGGGGATCG ACGCCGAGCT GATCGGTTTC
ATCGCCATCG CCGCGGCCGT GTTCCTGCTG GTGATTCCGC CGCTGCTGCG CGATCTCGTG
CACCGCGCGC TGCGCGCCGA CAAACCCCGC CTGGCCCTGC ATCTGCTGCA GCTCTGGGAG
CTTCTGCAGC CCGGCATGCT GTCGGGCGAG GAGCGCGAGA TGATCGCCAT CCGCGTGGCC
GCGCGCGACG GCAAGATCGA CGCCACCGCG GGCGAGATGC GCGCCCTGCG CGACCGCATC
CCCGATCCCC ACGGCCGTGT GCACATGGAT CACCTGATCG TGTTCCTGTA CCTGAGCGCG
CGGCGCTGGA GCGACGCGGT CCAGGCCTAC GAGCGCCGGC TCGAGGACGA GCCGCTGTCG
CCGCAGACCT GCACCGAGCT GATCTGGGCG TACTGCGAGC TCGGCGACCT GCGCGCGGCC
GCGGCCCTGG TCGAGCGCAT CGCGCAGGCG GCCGAGGCCG GCGAGCCCAT GTGGGGCTAT
CTGGCGCAGC GGGCGCGGCT GATGTTTCTG GCCTTTGCCG GCCAGCGCGA GGCGCTCGAC
GAGCTGTTCG CCGAGGACGG CGCGCTGCGC GCCCTGCCCC GCGCCAGCCA GGCCTTCTGG
TCGGGCGTGG CGCATCTGCG CGCCGGCGAG ACCGAGGCCG CGCGCCAGCT CCTCAAGCGC
GCCTCGCGCT GGACCCGCGG CGACGTGCGC GCCCGCGAGC TGGCGCGGCA GATGCTCGAT
CGGCTCGAGT CGGGCGACGC CGAGCCGGCC GCGATCGCGG CCGTGGAGCT CGACGACGAG
CTGCGCACCG CGGCCGAGGA TTTCCGCCGC GCGGCGCTGG CCGCGCCCGC CCTGCCCCAG
CGCACGCCCC AGGTCGGACG CGTGTCCCTG CGCCAGGTGC CCGTGAGCGC GGCGCTGGTG
CTGGCCAATC TGGCCGCCTT TGCCGCGGTC TACTGGATCT TCGACAGCAC CAGCGACACC
GGCGCCCTGG TGCGTGCGGG CGCCAACGTC AAAGCCTGGG TCACCGAGCG CGGCCAGCTC
TGGCGCCTGC CGACCTCGAT GTTCTTGCAC GTCGGCCTGC TTCACCTGCT GCTCAACGTC
TACGGCCTGT GGATGCTGGG CAAGCTGGTC GAGCAGACCC TGGGCTCGGT GCGCAGCTTT
GGCCTGTACA TGCTGTCCGG CCTGGTCGGC GCCTGGGCCA GCGCGCGCTT TGGCGCCGGC
GGCATCTCGG CCGGGGCCTC GGGCGCTGTG CTCGGCCTGC TCGGCGCGCT GATCGCCGAG
CTGGTCGTCC ACCACCGCGC CTACCCGCGC CACTTCCGCT CGGCCCTGCT CACCCCGCTG
GTGTTCGTGG CCGCGGCCCA GGTCGGCATC GGCTTCTTCT ATCCGGTCAT CGATCAATGG
GCGCACGTGG CCGGCCTGGC CACCGGCGCG TTTGCGGCCA TGGTGCTGTC GCCGCAGTCG
CGGACGCCGC GCGGCGTCTC GCTGGCGCTC GGCGTCCTGC TGAGCGCGCT CGGCGTGGGC
AGCATCGCGC TCGGCGCCCA CGGGGTCGCG ACCACGCGCT ACCGCAGCTT CTTCAACAGC
GCCTGGGAGA CGCAGACCCT GGGCGGCCTG GCGTTTCGCG CGCCGGCCGG GCTGCGACGC
GACTCGGGCG TGCTCATCGA CGGCGTCATG CTGTGGCTGG AAGCGACCAC CTGTCCGAAC
TCGCCGAGCG GCCCGGTGCA CAGCTTGTGC GCGCCGGAGG GCGGCGATGT CGGCGCCCAG
CTCGACCACG CCGAGGCCGT GCTGCGCCGC TCGTGGGGCG AGGCGACGCT GCGCGAGGTC
GAGGCCGAGA GCGGGCCGGC GCCATGGCAG CGGCGCGCCC TGCACGCCAC CGACATCGGC
ATCGATGGCG CGGATCGCTA CCGCATCATG CTGAGCGCGC GCGCCACGCC GAGCGCGACC
TGGCTGTTCA TCATCCAGAC CCCGAGCGCG CTGGCCGACG CGCTCGAGCC GAGCGTGGTC
GAGATGCTCG ACTCGGTGCG TGCGGCAGAT GCGCCCGCGC CCGCGCCCGC GCCCGCGCCC
GCTGCCGATA CCGCCGGCGA CTGA
 
Protein sequence
MYVDFLLLSV VAVTAYLVPV VWRRRPPGRR GFGWLLLADG GAALLALLEP PGIDAELIGF 
IAIAAAVFLL VIPPLLRDLV HRALRADKPR LALHLLQLWE LLQPGMLSGE EREMIAIRVA
ARDGKIDATA GEMRALRDRI PDPHGRVHMD HLIVFLYLSA RRWSDAVQAY ERRLEDEPLS
PQTCTELIWA YCELGDLRAA AALVERIAQA AEAGEPMWGY LAQRARLMFL AFAGQREALD
ELFAEDGALR ALPRASQAFW SGVAHLRAGE TEAARQLLKR ASRWTRGDVR ARELARQMLD
RLESGDAEPA AIAAVELDDE LRTAAEDFRR AALAAPALPQ RTPQVGRVSL RQVPVSAALV
LANLAAFAAV YWIFDSTSDT GALVRAGANV KAWVTERGQL WRLPTSMFLH VGLLHLLLNV
YGLWMLGKLV EQTLGSVRSF GLYMLSGLVG AWASARFGAG GISAGASGAV LGLLGALIAE
LVVHHRAYPR HFRSALLTPL VFVAAAQVGI GFFYPVIDQW AHVAGLATGA FAAMVLSPQS
RTPRGVSLAL GVLLSALGVG SIALGAHGVA TTRYRSFFNS AWETQTLGGL AFRAPAGLRR
DSGVLIDGVM LWLEATTCPN SPSGPVHSLC APEGGDVGAQ LDHAEAVLRR SWGEATLREV
EAESGPAPWQ RRALHATDIG IDGADRYRIM LSARATPSAT WLFIIQTPSA LADALEPSVV
EMLDSVRAAD APAPAPAPAP AADTAGD