Gene Jann_4161 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_4161 
Symbol 
ID3936650 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp4271607 
End bp4272776 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content61% 
IMG OID637906547 
Product4-hydroxybenzoate 3-monooxygenase 
Protein accessionYP_512103 
Protein GI89056652 
COG category[C] Energy production and conversion
[H] Coenzyme transport and metabolism 
COG ID[COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases 
TIGRFAM ID[TIGR02360] 4-hydroxybenzoate 3-monooxygenase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.314904 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGACCC AAGTTGCGAT CATCGGCGGC GGGCCGTCCG GCCTGTTGCT GGCCCAACTT 
CTGCACAGAC GCGGCATCGA TAGCATCGTG TTGGAGCGCA AGACCAAAGA CTACGTCCTG
GGCCGGATTC GCGCGGGCGT GCTGGAACAA GGGCTTGTCG GACTGCTGGA ACAGGCGGGC
TGCGCGGACC GCCTGCACGC AGAAGGGTTT ACCCACGACG GCACGCTGAT TTCCTACGGA
GATCAGATGT TTCGCGTGGA CTTCACCGAA CACGTGGGTC AGCCGGTGGT CGTCTATGGC
CAAACGGAAG TGACCCAGGA CCTTTACGCC GCCCGGGAGG CGTCGGGCGG GCAGATCGTC
TACAACGTCG ACGATGTGGA GATCCACGAT GCCAAGAGCG ACACGCCCTT TGTTACCTAT
CACGTAGACG GTCACGCTAA GCGCATCGAT TGTGACTTCA TTGCCGGGTG TGACGGCTTC
CACGGGATTA GCCGCAAGAC CATCCCCGAA GACGCGCGGC GGGAGTTTGA GAAGATCTAT
CCGTTCGGCT GGCTCGGCAT CCTGTCGGAA ACGCCGCCTG TGAACCACGA GTTGATCTAC
GCCAACCATC CACGCGGCTT CGCGCTGTGT TCCATGCGGA ACGCGCAGTT GAGCCGCTAT
TACATCCAAT GCTCTCTCGA TGATCATCCC GACAATTGGA GCGATCAGGC GTTTTGGGAG
GAGTTGAAGC GTCGCATCCC ACCCGCGCAG GCGGATGCGC TTGTGACGGG CCCCAGTATC
GAAAAATCCA TCGCGCCGCT GCGGTCCTTC GTGACCGAAC CGATGCGGTG GGGGCGGCTG
TTCCTGTGCG GCGACGCGGC CCATATCGTG CCACCCACCG GGGCGAAAGG GCTCAACACC
GCCGCGTCGG ACGTCCATTA TCTGTTCGAA GGGTTGAAGG CCTTCTACGC CGATGGGTCC
GACGAGGGCA TCGACGCCTA TTCCGAAAAA GCGCTCGCGC GGGTGTGGAA GGCCGAAAGG
TTCTCATGGT GGTTCACGAC GATGATGCAC CGCTTCCCCG ATCAGACCGC GTTTGACCTG
AAAATGCAGG TGGCCGATCT GGAATTCCTG CGCGGCTCCG CAAGTGCCCA GAAGGCCATG
GCCGAAAACT ACGTGGGCTT GCCCTATTGA
 
Protein sequence
MKTQVAIIGG GPSGLLLAQL LHRRGIDSIV LERKTKDYVL GRIRAGVLEQ GLVGLLEQAG 
CADRLHAEGF THDGTLISYG DQMFRVDFTE HVGQPVVVYG QTEVTQDLYA AREASGGQIV
YNVDDVEIHD AKSDTPFVTY HVDGHAKRID CDFIAGCDGF HGISRKTIPE DARREFEKIY
PFGWLGILSE TPPVNHELIY ANHPRGFALC SMRNAQLSRY YIQCSLDDHP DNWSDQAFWE
ELKRRIPPAQ ADALVTGPSI EKSIAPLRSF VTEPMRWGRL FLCGDAAHIV PPTGAKGLNT
AASDVHYLFE GLKAFYADGS DEGIDAYSEK ALARVWKAER FSWWFTTMMH RFPDQTAFDL
KMQVADLEFL RGSASAQKAM AENYVGLPY