Gene Shew185_2653 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShew185_2653 
Symbol 
ID5371968 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS185 
KingdomBacteria 
Replicon accessionNC_009665 
Strand
Start bp3160394 
End bp3161554 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content48% 
IMG OID640830877 
Producthomogentisate 12-dioxygenase 
Protein accessionYP_001366851 
Protein GI153001170 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3508] Homogentisate 1,2-dioxygenase 
TIGRFAM ID[TIGR01015] homogentisate 1,2-dioxygenase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.51516 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCATTTT ATGTGAAACA AGGCCAGATC CCCCATAAGC GCCATATCGC ATTTGAAAAG 
GAGAACGGCG AGCTTTACCG TGAAGAGTTG TTCTCAACCC ATGGTTTTTC CAATATTTAT
TCCAATAAAT ATCACCACAA CATGCCAACC AAAGCGTTGG AAGTGGCTCC TTACAGCCTA
GGCCACGGTG CCAATTGGGA AGACTCGCTG GTCCAAAACT ATAAACTCGA TTCCCGCGAT
GCCGACCGTG AAGGTAATTT TTTCAGCGCC CGTAACAAGA TTTTTTATAA CAATGACTTA
GCACTTTATA CCGCCAAAGT CACCGCCGAT ACCGACGAGT TTTACCGTAA TGCCTATGCC
GATGAGGTGC TTTTTGTCCA CGAAGGCGAA GGCACACTCT ACAGCGAGTA CGGCACGATT
AAGGTCCAAA AGTGGGACTA TCTCGTGATC CCACGCGGCA CGACTTATCA ACTTAAATTC
AACGATTACA GCAATGTGCG ACTTTTCGTT ATCGAGTCAT TCTCTATGGT GGAAGTGCCT
AAGCATTTCC GTAACGAGTA TGGTCAGCTA TTAGAATCGG CACCTTACTG CGAGCGCGAT
TTGCGCGTGC CCACATTGCA AGATGCTGTG GTTGAGCGCG GCGCCTTCCC CTTAATATGT
AAGTTTGGCG ATAAGTACCA ACTCACCACA CTAGAATGGC ATCCCTTTGA TTTAGTGGGT
TGGGATGGCT GTGCTTACCC TTGGGCATTC AACATTACTG AATACGCGCC AAAGGTGGGC
AAAATCCATC TGCCGCCTTC TGATCACTTA GTCTTTACCG CCCATAACTT TGTGATTTGT
AACTTCGTGC CGCGTCCCTA CGATTTCCAC CCGAAATCGA TTCCGGCGCC TTATTACCAC
AACAATATCG ATAGCGACGA AGTCTTGTAT TACGTCGATG GCGACTTTAT GAGCCGCACG
GGCATTGAAG CGGGTTATAT GACCTTACAT CAAAAAGGTG TGGCCCACGG CCCACAACCG
GGCCGCACTG AAGCCTCGAT TGGCAAGAAA GAAACCTATG AATACGCAGT GATGGTCGAC
ACCTTCGCCC CACTTAAATT AACCGAACAT GTACAAAATT GCATGAGCAA AGACTACAAC
CGCTCTTGGC TAGAAGACTA G
 
Protein sequence
MPFYVKQGQI PHKRHIAFEK ENGELYREEL FSTHGFSNIY SNKYHHNMPT KALEVAPYSL 
GHGANWEDSL VQNYKLDSRD ADREGNFFSA RNKIFYNNDL ALYTAKVTAD TDEFYRNAYA
DEVLFVHEGE GTLYSEYGTI KVQKWDYLVI PRGTTYQLKF NDYSNVRLFV IESFSMVEVP
KHFRNEYGQL LESAPYCERD LRVPTLQDAV VERGAFPLIC KFGDKYQLTT LEWHPFDLVG
WDGCAYPWAF NITEYAPKVG KIHLPPSDHL VFTAHNFVIC NFVPRPYDFH PKSIPAPYYH
NNIDSDEVLY YVDGDFMSRT GIEAGYMTLH QKGVAHGPQP GRTEASIGKK ETYEYAVMVD
TFAPLKLTEH VQNCMSKDYN RSWLED