Gene Spro_2096 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_2096 
Symbol 
ID5606473 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp2290413 
End bp2291675 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content60% 
IMG OID640937634 
Productprotocatechuate 4,5-dioxygenase 
Protein accessionYP_001478327 
Protein GI157370338 
COG category[S] Function unknown 
COG ID[COG3384] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.624399 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTAAGA TCATTGGCGG ATTGGCGGTG TCACACACCC CAACCATCGG TTTTGCAGTG 
GATCACAACA AGCAGAACGA AACGGCCTGG GCGCCCATTT TTGACGGTTT CGCCCCGATG
CAGCAATGGC TGGAAGAGAA AAAGCCGGAC GTGTTGCTGT ATGTCTTCAA CGACCACGTG
ACCTCGTTTT TCTTCGATCA CTATTCGGCA TTCGTGCTCG GCATTGACGA CAGCTACGCG
GTGGCGGACG AAGGTGGCGG CCCGCGTGAT TTGCCGCCGA TTCGCGGCCA TGCGGCGCTG
TCACAGCACA TCGGTGCCAG CCTGATGGCC GACGAGTTTG ATATGTCGTT CTTCCAGGAT
AAGCCGCTCG ATCACGGGCT GTTCTCTCCG CTCTCCGCCC TGCTGCCGTG GCAGAATGGC
TGGCCGATGC AGGTGGTGCC GCTGCAGGTC GGGGTGCTGC AGTTCCCGAT CCCTTCGGCT
CGCCGCTGCT ACAAGCTCGG TCAGGCGCTG CGCCGGGCAA TTGAAAGCTT CCCGGAAGAC
TTGCGCGTCG CGGTGGTGGC CACCGGCGGC GTCTCGCATC AGGTGCATGG CGAACGTTGC
GGTTTTAATA ATCCGCAGTG GGATGAGCAG TTTGTCGACC TGCTGGTCAA TGACCCGGAG
CGCCTGACCG AAATCACGCT GGCGGAGTAC GCCACCTTGG GTGGGCTGGA GGGCGCGGAG
GTGATCATGT GGCTGATTAT GCGCGGCGCC CTGTCGGCCA ACGTCGAAAA ACTGCATCAG
GCCTATTACC TGCCGTCCAT GACCGGTATC GCCACGCTGA TCCTGGAAAA CCAGGCGCGC
GAGGCACCGG TAGATGTCCA TCAGCGCCAG CGCGACAAAA TCAACCTGCA ACTGGCGGGG
GTTGAGAAAC TGCCGGGCAC CTATCCCTTT ACCCATGCGC GCAGCCTGAA AGCCATCCGC
ATCAACCGTT TCCTGCACAA ACTGATCCAG CCGGCCTGGC GCGAACGCTT CAATAACGCC
CAGCAGGCGC TGTTCGACGA AGCGCAGCTC ACCACTGAGG AACAGCAGCT GCTGCGCGAG
CTGGACTGGC GCGGGCTGAT CCATTACGGC GTCAGTTTCT TCCTGTTGGA AAAGCTCGGG
GCAGTGGTCG GGGTATCCAA CCTGCATATC TACTCGGCGA TGCGTGGCCA GACGCTGGAT
GAGTTCCAGC AAACCCGCAA TCAGCAAGTG TTGTATTCCG TTGCGGGGAA AGCGCCAAAA
TGA
 
Protein sequence
MAKIIGGLAV SHTPTIGFAV DHNKQNETAW APIFDGFAPM QQWLEEKKPD VLLYVFNDHV 
TSFFFDHYSA FVLGIDDSYA VADEGGGPRD LPPIRGHAAL SQHIGASLMA DEFDMSFFQD
KPLDHGLFSP LSALLPWQNG WPMQVVPLQV GVLQFPIPSA RRCYKLGQAL RRAIESFPED
LRVAVVATGG VSHQVHGERC GFNNPQWDEQ FVDLLVNDPE RLTEITLAEY ATLGGLEGAE
VIMWLIMRGA LSANVEKLHQ AYYLPSMTGI ATLILENQAR EAPVDVHQRQ RDKINLQLAG
VEKLPGTYPF THARSLKAIR INRFLHKLIQ PAWRERFNNA QQALFDEAQL TTEEQQLLRE
LDWRGLIHYG VSFFLLEKLG AVVGVSNLHI YSAMRGQTLD EFQQTRNQQV LYSVAGKAPK