Gene Jann_1439 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_1439 
Symbol 
ID3933886 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp1406329 
End bp1407711 
Gene Length1383 bp 
Protein Length460 aa 
Translation table11 
GC content62% 
IMG OID637903789 
Producthomogentisate 1,2-dioxygenase 
Protein accessionYP_509381 
Protein GI89053930 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3508] Homogentisate 1,2-dioxygenase 
TIGRFAM ID[TIGR01015] homogentisate 1,2-dioxygenase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGAAC AAGCCACACG CAGACCGATG ATCCAGGCCC CCAGCCACCC CGGCCCCCAT 
GACGGCTATA TGCCGGGCTT CGGCAATGAC TTTGAGACAG AGGCGCTGCC GGGTGCGCTG
CCGCAGGGGA TGAACTCGCC CCAGAAGGTA AATTACGGCC TCTACGGCGA ACAGCTCTCC
GGCACCGCCT TCACCGATGT GCGGCCTGAG CGGACGTGGT GTTACCGCAT CCGTCCCTCC
GTCAAGCACT CACACCGTTA CTCCAAGATC GACCTGCCCT ACTGGCACTC CGCGCCGACC
ATTGACCCCG ATGTGATCTC ACTAGGTCAG TACCGCTGGG ACCCTGTTCC CCATTCCGAC
ACGCCGCTGA CCTGGCTCAC CGGCATGCGC ACCATGACCA GCGCGGGGGA TGTGAACACG
CAGGTCGGCA TGGCGACCCA TGTCTATCTG GTCACGCAAA GCATGGTGGA CGATTATTTC
TACTCCGCTG ACAGTGAGAT GCTGGTTGTC CCGCAGGAGG GCCGCCTGCG CTTCTGCACC
GAGCTTGGCA TCATCGACGT GGAGCCGCAG GAGATCGCCA TTCTGCCACG CGGTCTTGTG
TACCGGGTGG AGGTGCTGGA CGGCCCCGCG CGCGGCTTTG TTTGCGAAAA CTACGGCGCG
AAGTTTGACC TTCCGGGGCG CGGCCCCATT GGCGCGAACT GCATGGCCAA CCCGCGCGAC
TTCAAGGCCC CCGTCGCGGC CTATGAGGAC CGCGAAGTGC CGTCCACAAT CACGATCAAA
TGGTGCGGCC AGTTCCACAC GTCGAAGATC GCGCAGAGCC CGCTGGATGT CGTGGCCTGG
CACGGCAATT ACGCGCCCTA CAAATACGAT CTGAAGACCT ATTGCCCCGT CGGCGCGATC
CTGTTCGACC ACCCGGACCC GTCGATCTTC ACGGTGCTGA CCGCGCCATC GGGCCAACCG
GGCGTCGCCA ACATCGACTT CGTGCTGTTC CGCGAGCGCT GGATGGTGGC CGAAAACACG
TTCCGCCCGC CGTGGTATCA CAAGAACATC ATGTCCGAAC TGATGGGCAA CATCTACGGC
CAATATGACG CCAAGCCCAA GGGGTTCGTG CCCGGCGGTA TCTCCCTGCA CAACATGATG
ATCCCCCACG GCCCCGACAA AAACGCCTTC GAGGGCGCGT CAAACGCCGA CCTTCAGCCG
CAGAAGCTCG ATAACACAAT GTCCTTCATG TTCGAGACCC GCTTCCCCCA ACACCTCACG
GCCTTTGCGG CGAATGAGGC CCCGTTGCAG GACGACTACA TCGACTGCTG GGAAACGCTG
GAGAAGAAGT TTGATCCCTC GCAGCGCCCC GATGCGGGTC ACGGGACGCC GGGCAAAAAA
TGA
 
Protein sequence
MNEQATRRPM IQAPSHPGPH DGYMPGFGND FETEALPGAL PQGMNSPQKV NYGLYGEQLS 
GTAFTDVRPE RTWCYRIRPS VKHSHRYSKI DLPYWHSAPT IDPDVISLGQ YRWDPVPHSD
TPLTWLTGMR TMTSAGDVNT QVGMATHVYL VTQSMVDDYF YSADSEMLVV PQEGRLRFCT
ELGIIDVEPQ EIAILPRGLV YRVEVLDGPA RGFVCENYGA KFDLPGRGPI GANCMANPRD
FKAPVAAYED REVPSTITIK WCGQFHTSKI AQSPLDVVAW HGNYAPYKYD LKTYCPVGAI
LFDHPDPSIF TVLTAPSGQP GVANIDFVLF RERWMVAENT FRPPWYHKNI MSELMGNIYG
QYDAKPKGFV PGGISLHNMM IPHGPDKNAF EGASNADLQP QKLDNTMSFM FETRFPQHLT
AFAANEAPLQ DDYIDCWETL EKKFDPSQRP DAGHGTPGKK