Gene Jann_2039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_2039 
Symbol 
ID3934492 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp2044603 
End bp2045520 
Gene Length918 bp 
Protein Length305 aa 
Translation table11 
GC content61% 
IMG OID637904395 
Productfumarylacetoacetate (FAA) hydrolase 
Protein accessionYP_509981 
Protein GI89054530 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.630421 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.541544 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATCG GTACGATCCA CTTTCAGGGC CGGGACAGGT TGGTGGCCCG CCTGGACGAT 
GACCAGGTTC TCGACCTGCT GGAGGCCTCA GGCGGCGACG TCCATTTCGT CAACACGGTC
GCGATGATTG AAGCGGGACC CGCCGCATTG GACAAAGCGC GTGCGGTGCT GGAGGATCCC
CCGACGCGCG CAGTGATCGC CCTGTCGCAG GCCGAGCACC GCGTGCCGCT GCGCCCCGTG
CAGTATCGCG ATTGCCTTGT GTTTGAGCAG CATCTGATCA ACGGGTTCAA GCAGGCCGAG
AAGATGACGG GGCGCCCCTT CGCCATTCCG CCGGTCTGGT ACGAGCAGCC GATCTACTAC
AAGGGCAATC GGATGTCCTT CATTGGCCAC GGCCAGACGG TGCGATGGCC CGCCTACTCA
GACTATCTGG ATCTGGAGCT GGAACTGGCG ATCATCATCG GGAAGGAGGG CGCTGATATC
CCGCGCGAGA CGGCCCATGA GCATATCTGG GGCTATACCA TCCTTAATGA TGTTTCCGCC
CGGGATGCGC AGATGCGGGA GATGGCGGGA CAGCTGGGGC CTGCAAAGGG CAAGGATTTT
GATACAGGCA ATATCCTCGG GCCTTGGATC GTGACCGCCG ATGAAGTGTC GCATCCGGCC
GTTTTGAACA TGGATGTGAG CGTGAACGGT GAGCGGTGGG GTGGTGGCAC GTCCGCCGAT
ATGCAGTTCG ATTTCGCGCA GATCATCGCA CATATCTCCG CGTCTGAGCG GCTATTTCCC
GGTGAAGTGA TCGGCTCCGG CACGGTCGGC ACTGGCTGTG GGCTGGAGAT CGGCAAGCGG
CTCAGCGACG GCGATATGAT GGATTTGACG ATTGAGAAGA TCGGAACCCT GACCAACACA
ATCAAGAAAG GGGCCTGA
 
Protein sequence
MKIGTIHFQG RDRLVARLDD DQVLDLLEAS GGDVHFVNTV AMIEAGPAAL DKARAVLEDP 
PTRAVIALSQ AEHRVPLRPV QYRDCLVFEQ HLINGFKQAE KMTGRPFAIP PVWYEQPIYY
KGNRMSFIGH GQTVRWPAYS DYLDLELELA IIIGKEGADI PRETAHEHIW GYTILNDVSA
RDAQMREMAG QLGPAKGKDF DTGNILGPWI VTADEVSHPA VLNMDVSVNG ERWGGGTSAD
MQFDFAQIIA HISASERLFP GEVIGSGTVG TGCGLEIGKR LSDGDMMDLT IEKIGTLTNT
IKKGA