Gene SO_0471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSO_0471 
Symbol 
ID1168344 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella oneidensis MR-1 
KingdomBacteria 
Replicon accessionNC_004347 
Strand
Start bp495533 
End bp496612 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content54% 
IMG OID637342468 
Productdioxygenase 
Protein accessionNP_716108 
Protein GI24372066 
COG category[R] General function prediction only 
COG ID[COG2070] Dioxygenases related to 2-nitropropane dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGTGTC GACTCACGCG ATTATTTGGG ATTGAATTTC CGATTATTCA GGCACCTATG 
GCCGGAGTGC AGGGGAGTGC GCTCGCGATT GCCGTCTCTG AGGCGGGTGG TTTAGGTTCA
TTACCCTGTG CCATGTTATC CCTTGAGGCG CTCGAGGCTG AATTAACTGC AATACGCTCG
CAAACCGCTA AACCTATCAA TGTGAACTTT TTCTGCCATC GCGAGCCTGT AGCGCAGGCA
GCTAAACAAG CCGCTTGGCT TGAACAGTTA GCGCCCTATT TTGCGGAATT TAATCTCGAC
CCAAACGCGC AGCCTGCTGG CGCGCAGCGC ACACCCTACA GCAAGGCGCA GGCTGAGGTG
TTAGCCAAGT TTAAGCCCGA GGTGGTGAGT TTCCATTTTG GTTTGCCCGA TGAAGAGTTG
CTGCTGGAAA TTAAATCTTG GGGCTCAAAA GTTATCTCCA CGGCGACCAC AGTCGAAGAA
GCGCTCTGGC TCGAAGCCCG TGGCGCGGAT GCGATTATTG CCCAAGGTTT AGAGGCTGGA
GGCCACAGAG GGCACTTTTT ATCCGAGGAT TTAACCGAGC AGCTCGGCAC TTTTAGTCTA
TTACCACAGA TTATTGCGGC GGTGGAGATT CCCGTGATAG CCGCAGGCGG CATAGTCGAT
GCCACCACGG TTCGGGCGGC AATGACAATG GGCGCTTCGG CCGTGCAAGT GGGGACGGCT
TATTTGCTCT GTCCAGAATG TAATACCAGT GCAATCCATC GTGAGGCGTT GCAAAGTGAC
GCTGCGCAAC ATACGGCACT GACTAATTTA TTTTCCGGTA GACCTGCGCG TGGCATAGTG
AACCGTTTTA TGGCAGAAAT GGGACCGATG AATGAGGCTG TGCCTGATTT CCCCTTGGCA
TCCTCGGCGG TTGCAGGCTT AAGGACAGCG GCGGAGCGAC TAGGATTTTG GGATTTTAGT
CCGCTATGGT GCGGGCAGAA TGCCAGTGGG TGCCGAGCGA TCCCTGCCGC AGATTTGACT
AGAAGCTTTG TGCTAAGCTT GCCCTCATCT TGCGTTGAGC CGCAAGAAAA GTCTGGCTAA
 
Protein sequence
MPCRLTRLFG IEFPIIQAPM AGVQGSALAI AVSEAGGLGS LPCAMLSLEA LEAELTAIRS 
QTAKPINVNF FCHREPVAQA AKQAAWLEQL APYFAEFNLD PNAQPAGAQR TPYSKAQAEV
LAKFKPEVVS FHFGLPDEEL LLEIKSWGSK VISTATTVEE ALWLEARGAD AIIAQGLEAG
GHRGHFLSED LTEQLGTFSL LPQIIAAVEI PVIAAGGIVD ATTVRAAMTM GASAVQVGTA
YLLCPECNTS AIHREALQSD AAQHTALTNL FSGRPARGIV NRFMAEMGPM NEAVPDFPLA
SSAVAGLRTA AERLGFWDFS PLWCGQNASG CRAIPAADLT RSFVLSLPSS CVEPQEKSG