Gene SO_1963 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSO_1963 
Symbol 
ID1169723 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella oneidensis MR-1 
KingdomBacteria 
Replicon accessionNC_004347 
Strand
Start bp2068671 
End bp2069831 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content48% 
IMG OID637343846 
Producthypothetical protein 
Protein accessionNP_717570 
Protein GI24373527 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3508] Homogentisate 1,2-dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCATTTT ATGTGAAACA AGGCCAAGTA CCCCATAAGC GCCATATCGC ATTTGAGAAG 
GAAAACGGCG AGCTATACCG TGAAGAGCTG TTTTCAACCC ATGGTTTTTC CAATATTTAT
TCCAATAAAT ATCACCACAA TATGCCGACG AAGGCGTTAG AAGTCGCCCC TTACCGCCTC
GGTCACGGTG CTCATTGGGA AGACTCATTA GTTCAAAACT ATAAACTGGA TTCCCGCTCA
GCCGATCGTG AAGGCAACTT CTTCAGTGCC CGCAATAAAA TCTTCTATAA CAATGATGTG
GCAATTTACA CCGCAAAAGT CACCCAAGAC ACCGCTGAGT TTTACCGTAA TGCCTACGCC
GATGAAGTGG TATTTGTGCA CGAAGGTGAA GGCACACTGT ATAGCGAGTA CGGAACCTTA
GAGATTAAAA AATGGGATTA TTTAGTGATC CCACGCGGCA CCACACATCA ACTTAAATTC
AACAATTACA GTAATGTGCG TTTGTTTGTG ATTGAAGCCT TTTCAATGGT GGAAGTGCCT
AAGCATTGCC GTAATGAATA CGGCCAGTTA CTCGAATCCG CACCCTACTG TGAACGCGAT
CTGCGCACGC CCATTTTGCA AGCTGCTGTG GTTGAGCGTG GTGCCTTCCC GCTAGTATGT
AAATTTGGCG ATAAATACCA GCTGACAACG CTAGAGTGGC ATCCATTTGA TTTAGTGGGC
TGGGATGGTT GTGTTTACCC TTGGGCATTT AACATCACCG AATACGCGCC CAAGGTTGGA
AAAATCCATT TACCGCCTTC GGACCATCTC GTGTTTACCG CCCACAACTT TGTGGTGTGT
AACTTTGTGC CGCGCCCCTA TGATTTCCAC GAGCGAGCCA TTCCCGCGCC TTACTATCAC
AACAACATCG ATAGTGACGA AGTGCTGTAC TACGTTGATG GCGACTTTAT GAGCCGCACC
GGGATTGAAG CAGGCTACAT CACCTTACAC CAAAAAGGCG TAGCGCACGG CCCACAGCCC
GGCCGCACCG AAGCCTCGAT TGGCAAAAAA GAAACCTATG AATATGCAGT GATGGTGGAC
ACCTTCGCCC CACTGAAATT AACCGAACAT GTGCAGCATT GCATGAGTAA AGACTACAAC
CGCTCCTGGC TAGAAAACTA A
 
Protein sequence
MPFYVKQGQV PHKRHIAFEK ENGELYREEL FSTHGFSNIY SNKYHHNMPT KALEVAPYRL 
GHGAHWEDSL VQNYKLDSRS ADREGNFFSA RNKIFYNNDV AIYTAKVTQD TAEFYRNAYA
DEVVFVHEGE GTLYSEYGTL EIKKWDYLVI PRGTTHQLKF NNYSNVRLFV IEAFSMVEVP
KHCRNEYGQL LESAPYCERD LRTPILQAAV VERGAFPLVC KFGDKYQLTT LEWHPFDLVG
WDGCVYPWAF NITEYAPKVG KIHLPPSDHL VFTAHNFVVC NFVPRPYDFH ERAIPAPYYH
NNIDSDEVLY YVDGDFMSRT GIEAGYITLH QKGVAHGPQP GRTEASIGKK ETYEYAVMVD
TFAPLKLTEH VQHCMSKDYN RSWLEN