Gene Ssed_2687 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsed_2687 
Symbol 
ID5611834 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sediminis HAW-EB3 
KingdomBacteria 
Replicon accessionNC_009831 
Strand
Start bp3243301 
End bp3244341 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content46% 
IMG OID640933606 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionYP_001474422 
Protein GI157375822 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.686037 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAGCG AACAAAATCC ACTGGGTTTA TTGGGCATAG AATTTACAGA ATTTGCCACC 
CCAGATCTTG ATTTCATGCA TCAAGTGTTT ATTGATTTCG GTTTTTCTAA GCTAAAGAAA
AGTAAAACTA AAGACATTAG CTACTACAAG CAGAACGACA TTAACTTTTT GCTGAACAAT
GAAGTTCGCG GCTTTTCGGC AGAGTTCGCT AAGAGTCACG GCCCCGCGAT CTGTTCGATG
GGCTGGCGTG TAGAAGATGC CCAGTTTGCT TTCGAAGGCG CAGTGGCACG CGGTGCTAAA
CCTGCAACAG AAGAAAATAA AGACCATCCT TACCCCGCCA TTTACGGTAT TGGCGACAGC
CTGATCTACT TTATCGACCT GTTCGGCAGT GAAAGTAATA TCTACCAGAA TGATTTCGTC
GATCTTGAAG AGCCTGTGAT CACTCAGGAG AAAGGCTTTA TCGAAGTCGA TCACCTGACC
AACAATGTTT ACAAAGGGAC GATGGAACAT TGGGCCAACT TCTACAAAGA TATCTTTGGT
TTCACCGAAG TGCGCTACTT CGATATTAAA GGGGCCCAAA CGGCCTTAAT CTCTTATGCG
CTACGTTCAC CCGACGGCAG CTTCTGTATC CCGATTAATG AAGGCAAAGG CAGTGACAAG
AATCAGATCG ATGAATACCT CAGAGAGTAT GATGGACCCG GCGTTCAACA CCTGGCATTC
AGAAGCCGGG ATATCGTCGC GTCACTAGAT GCGATGGAAG GTTCATCGAT AAAAACTTTA
GATATCATCC CCGAGTATTA CGACACCATC TTCGAAAAAT TGCCACAGGT GACAGAAGAC
AGAGAAAAAA TTAAACATCA TCAGATCTTA GTCGATGGCG ACGAAGAGGG TTACCTGTTA
CAGATCTTCA CTAAGAACCT GTTCGGCCCC ATCTTTATCG AGATCATTCA GCGCAAGAAT
AACCTGGGAT TCGGAGAGGG TAACTTTACC GCCCTGTTCC AGTCTATCGA ACGGGATCAA
CAGCGCCGCG GTGTGCTGTA A
 
Protein sequence
MASEQNPLGL LGIEFTEFAT PDLDFMHQVF IDFGFSKLKK SKTKDISYYK QNDINFLLNN 
EVRGFSAEFA KSHGPAICSM GWRVEDAQFA FEGAVARGAK PATEENKDHP YPAIYGIGDS
LIYFIDLFGS ESNIYQNDFV DLEEPVITQE KGFIEVDHLT NNVYKGTMEH WANFYKDIFG
FTEVRYFDIK GAQTALISYA LRSPDGSFCI PINEGKGSDK NQIDEYLREY DGPGVQHLAF
RSRDIVASLD AMEGSSIKTL DIIPEYYDTI FEKLPQVTED REKIKHHQIL VDGDEEGYLL
QIFTKNLFGP IFIEIIQRKN NLGFGEGNFT ALFQSIERDQ QRRGVL