Gene Sbal223_2865 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal223_2865 
Symbol 
ID7089434 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS223 
KingdomBacteria 
Replicon accessionNC_011663 
Strand
Start bp3364528 
End bp3365514 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content47% 
IMG OID643461750 
Productfumarylacetoacetate (FAA) hydrolase 
Protein accessionYP_002358774 
Protein GI217974023 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0172279 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000000624401 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAACTTG CAAGTTATAA CAATGGTCGC CGCGATGGTC AGTTGATGTT AGTTAGCCGC 
GATCTCACTA AAACGGTTGC CGTTCCTGCT ATTGCCCACA CTATGCAGCA ATTACTCGAC
GGTTGGGATT TACTGAAACC GCAATTGCAA GAATTGTACG ACGCGTTAAA TGAAGGCATG
TTAGATAACG CGCAAGCGTT TGACGAAGCT AAGTGTTTGT CGCCACTACC GCGTGCCTAC
CAATGGGCCG ATGGCAGTGC TTATGTGAAT CACGTTGAGC TAGTCCGTAA GGCTCGCGGC
GCAGAAATGC CTGAAACCTT CTGGACAGAT CCGCTGTTCT ATCAAGGTGG TTCAGACAGT
TTTATTGCCC CTAAAGCCGA TATTCCACTG GCGAGTGAAG ACTGGGGTAT CGATTTCGAA
TCTGAAATCG CGATCATTAC CGATGATGTG CCTATGGGCG TGAGCAGCGA CAATGCGGCT
AAGCATATTA AATTGTTGAT GTTAGTGAAC GATGTGTCAC TGCGTAACTT AATCCCAGGT
GAGTTGGCGA AAGGGTTTGG TTTCTTCCAA TCTAAGCCAT CGAGCAGTTT TTCACCTGTG
GCAGTGACGC CGGATGAACT GGGCGCACGC TGGGAAGATT CGAAAGTGCA TTTACCGCTG
ATCACCCATT TAAACGGTGA GTTATTTGGT CGTCCAAATG CGGGCGTCGA TATGACCTTT
AATTTCAGCC AGTTAGTTTC ACATGTTGCA AAAACCCGTC CATTGGGTGC AGGCGCGATT
ATTGGCTCTG GTACTATTTC TAACTATGAC CGTAGCGCAG GTTCTAGCTG TTTAGCTGAA
AAACGTATGT TAGAAGTGAT TGCCGAAGGC AAAGCCACTA CACCATTTAT GCGTTTTGGC
GACACTGTGC GCATCGAAAT GCTTGATGAC AACAATGTAA CGATTTTCGG TTCTATCGAT
CAAAAAGTGG TTGAATACAA AGCCTAA
 
Protein sequence
MKLASYNNGR RDGQLMLVSR DLTKTVAVPA IAHTMQQLLD GWDLLKPQLQ ELYDALNEGM 
LDNAQAFDEA KCLSPLPRAY QWADGSAYVN HVELVRKARG AEMPETFWTD PLFYQGGSDS
FIAPKADIPL ASEDWGIDFE SEIAIITDDV PMGVSSDNAA KHIKLLMLVN DVSLRNLIPG
ELAKGFGFFQ SKPSSSFSPV AVTPDELGAR WEDSKVHLPL ITHLNGELFG RPNAGVDMTF
NFSQLVSHVA KTRPLGAGAI IGSGTISNYD RSAGSSCLAE KRMLEVIAEG KATTPFMRFG
DTVRIEMLDD NNVTIFGSID QKVVEYKA