Gene Shewmr4_2604 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_2604 
Symbol 
ID4253175 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp3099761 
End bp3100747 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content51% 
IMG OID638119239 
Productfumarylacetoacetate (FAA) hydrolase 
Protein accessionYP_734732 
Protein GI113970939 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000203014 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000504618 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAACTTG CCAGTTATAA CAATGGTCGC CGTGATGGCC AGCTGATGTT AGTGAGCCGC 
GATCTTACTC AAACGGTTGC CGTACCCGCG ATTGCCCATA CGATGCAACA ATTACTCGAT
GGTTGGGATC TGCTCAAGCC ACAATTGCAA GAATTGTATG ATGCGCTGAA CGAAGGCAAA
TTACCAAACG CACAAGCCTT CGATGAAGCC AAATGTTTAT CACCTTTGCC ACGTGCGTAC
CAGTGGGCCG ATGGTAGCGC CTATGTTAAC CATGTGGAAT TAGTCCGTAA GGCGCGCGGC
GCTGAAATGC CAGAAACCTT CTGGACCGAT CCGCTATTTT ACCAAGGCGG CTCTGACAGC
TTTATCGCGC CAAAGGCGGA TATCTCGCTG GCGAGCGAAG ACTGGGGTAT CGATTTCGAA
TCGGAAATCG CCGTGATCAC CGATGATGTG CCTATGGGCG TGAGTGTTGA AAATGCTACG
TCACACATTA AGCTGTTGAT GTTAGTGAAC GACGTATCTC TGCGTAACCT GATCCCCGCA
GAGCTGGCGA AAGGTTTCGG TTTCTTCCAA TCCAAACCTT CGAGCAGCTT CTCACCTGTC
GCCATCACGC CAGATGAATT AGGCCACCGC TGGGAAGATT CAAAGGTGCA TTTACCGCTT
ATCACCCATT TAAATGGCAA ACTATTCGGT CGCCCGAATG CGGGCGTGGA TATGACCTTT
AACTTCAGTC AGTTAGTTTC TCATGTTGCT AAAACCCGTC CATTAGGCGC GGGCGCGATT
ATCGGTTCGG GTACGATTTC TAACTATGAC CGCAGTGCCG GCTCAAGCTG TTTGGCCGAG
AAACGTATGC TCGAAGTGAT CGCCGACGGC AAAGCCAGCA CGCCGTTTAT GCGTTTTGGC
GACACTGTGC GCATCGAAAT GCTCGATGAT AACGGCGCCT CTATTTTTGG CTCTATCGAT
CAAAAAGTGG TTGAGTACAA GGCGTAA
 
Protein sequence
MKLASYNNGR RDGQLMLVSR DLTQTVAVPA IAHTMQQLLD GWDLLKPQLQ ELYDALNEGK 
LPNAQAFDEA KCLSPLPRAY QWADGSAYVN HVELVRKARG AEMPETFWTD PLFYQGGSDS
FIAPKADISL ASEDWGIDFE SEIAVITDDV PMGVSVENAT SHIKLLMLVN DVSLRNLIPA
ELAKGFGFFQ SKPSSSFSPV AITPDELGHR WEDSKVHLPL ITHLNGKLFG RPNAGVDMTF
NFSQLVSHVA KTRPLGAGAI IGSGTISNYD RSAGSSCLAE KRMLEVIADG KASTPFMRFG
DTVRIEMLDD NGASIFGSID QKVVEYKA