Gene Sama_1569 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSama_1569 
Symbol 
ID4603821 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella amazonensis SB2B 
KingdomBacteria 
Replicon accessionNC_008700 
Strand
Start bp1913337 
End bp1914497 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content54% 
IMG OID639780925 
Producthomogentisate 1,2-dioxygenase 
Protein accessionYP_927446 
Protein GI119774706 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3508] Homogentisate 1,2-dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCATTTT ATGTGAAGCA AGGTCAGATC CCGCACAAGC GCCATATCGC GTTTGCCAAG 
GAAAATGGAG AGCTTTACCG CGAAGAGCTG TTCTCTACCC ACGGGTTTTC GAACATCTAT
TCCAATAAGT ACCACCACAA TATGCCCACC AAGGCGCTGG AAGTGGCCCC TATGTCACTG
ACCCACGGCG ACACCTGGGT CGATTCGCTG GTACAGAACT ATAAGCTGGA CTCACGTCTG
GCCGATGGTG AAGGCAATTT CTTCTCTGCC CGCAACAAGA TTTTCTTTAA CGCAGATGTC
GCGCTCTACA CCGCCAAGGT GACTGCCGAC ACCGACGAGT TTTACCGTAA CGCCTATGCC
GATGAAGTGG TGTTTGTACA CGAAGGCGAC GGCGTGCTCT TGTCTGAATA CGGCGAGCTT
GAAGTCAAGA AGTGGGACTA TCTGGTCATT CCGCGCGGCA CGACTTATCA GCTGAAGTTC
AACGATTACA GCAATGTACG CCTGTTCGTG ATTGAATCTT TCACCATGGT GGAGATCCCT
AAGCATTTCC GTAACGAATA CGGTCAGCTA CTGGAATCTG CCCCTTACTG CGAGCGCGAT
ATCCGCGTCC CCGAGCTAAA AGACGCCGTG GTTGAGCGCG GAAACTTCCC GCTGGTGTGT
AAGTTTGGCG ATAAGTACCA ACTGACTCAG CTGGAGTGGC ATCCCTTTGA TTTGGTGGGC
TGGGACGGTT TTGTCTATCC CTGGGCCTTC AACATCACCG AATACGCGCC CAAGGTGGGT
AAAATCCACC TGCCGCCGTC GGATCACCTG CTGTTTGTGG CCAGAAACTT CGTTATCTGC
AACTTTGTAC CGCGCCCCTA TGATTTCCAC CCGCAGGCGA TCCCGGCCCC GTACTACCAT
AACAACATCG ACAGTGACGA AGTGCTCTAT TACGTCGACG GTGATTTTAT GAGCCGCACC
GGCATTGAAG CCGGTTACAT CACCCTGCAC CAGAAAGGCG TGGCCCACGG CCCGCAACCC
GGCCGCACCG AGGCCTCCAT AGGCAAGAAG GACACCTACG AGTACGCCGT GATGGTAGAC
ACCTTTGCGC CACTGCAGCT GACCGAACAC GTGAAAAACT GCATGGCACC CGATTACAAC
CGCTCCTGGC TCGAAGACTA A
 
Protein sequence
MPFYVKQGQI PHKRHIAFAK ENGELYREEL FSTHGFSNIY SNKYHHNMPT KALEVAPMSL 
THGDTWVDSL VQNYKLDSRL ADGEGNFFSA RNKIFFNADV ALYTAKVTAD TDEFYRNAYA
DEVVFVHEGD GVLLSEYGEL EVKKWDYLVI PRGTTYQLKF NDYSNVRLFV IESFTMVEIP
KHFRNEYGQL LESAPYCERD IRVPELKDAV VERGNFPLVC KFGDKYQLTQ LEWHPFDLVG
WDGFVYPWAF NITEYAPKVG KIHLPPSDHL LFVARNFVIC NFVPRPYDFH PQAIPAPYYH
NNIDSDEVLY YVDGDFMSRT GIEAGYITLH QKGVAHGPQP GRTEASIGKK DTYEYAVMVD
TFAPLQLTEH VKNCMAPDYN RSWLED