Gene Shewmr4_0097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_0097 
Symbol 
ID4250976 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp106986 
End bp108212 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content52% 
IMG OID638116639 
Productimidazolonepropionase 
Protein accessionYP_732235 
Protein GI113968442 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID[TIGR01224] imidazolonepropionase 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTTGGG ATCAGGTTTG GATAGACGTT AACGTAGCAA CAATGGACCC TTCCATATCA 
GCACCTTACG GCGCAATTAC CAATGCAGCT ATCGCAGTAA AAGACGGTAA AATTGCCTGG
TTAGGCCCAC GCAGCGAGCT GCCCGCCTTC GATGTGTTGT CCATTCCTGT TTACAGGGGC
AAGGGCGGTT GGATCACTCC TGGGCTGATT GATGCCCACA CCCATCTGGT ATTTGCCGGT
AATCGTGCCA ACGAATTCGA GCTACGCCTA AAGGGCGCTA CCTATGAGGA AATCGCCCGT
GCTGGCGGCG GCATTATTTC CACGGTTAAC GCCTGCCGTG AGGCCGACGA AGCCGAGTTA
TTTGATCTCG GCCGCCAACG TTTAAATGCC TTGGCGAAGG AAGGTGTCAC TACGGTTGAG
ATTAAATCTG GCTATGGTCT AGATACCGAA ACCGAACTCA AAATCCTGCG TGTTGCCCGC
GAACTCGGCC AACATCACCA TGTGGATGTG AAGACCACCT TCCTCGGTGC CCATGCGGTG
CCGCCCGAGT TTAAAGACAA TAGCGACGGC TATGTCGACT TAATAATCAA TAAAATGCTG
CCTGCGGTGA TTGCCGAAAA TCTTGCCGAT GCGGTGGATG TATTCTGTGA AAACATCGCC
TTTAACCTAG AGCAAACCGA GCGCGTGCTG AGCGCCGCCA AAGCGGCTGG CCTGCAAGTC
AAACTGCACG CCGAGCAATT ATCCAATATG GGCGGCTCTG AATTAGCCGC ACGCTTAGGG
GCTAAGTCGG TTGATCATAT TGAATATTTA GATGAGGCTG GTGTTAAAGC CCTAAGTGAA
AGTGGCACCT GCGCCGTGCT GTTACCGGGC GCGTTTTACT TTTTGCGGGA AACCCAAAAA
CCACCTATCG ACTTATTGCG TCAATACGGT GTGCCTATGG TGCTCGCCAG CGACTTTAAT
CCCGGCTCAT CGCCCATCTG TTCGACCCTG CTGATGCTGA ACATGGGTTG CACCCTATTC
CGCTTAACAC CAGAGGAAGC GCTTGCGGGT TTAACATTGA ATGCCGCCAA GGCACTAGGG
ATTGAAGAGA ATGTCGGCAG CTTGGTGGTT GGTAAGCAGG CGGATTTCTG TCTGTGGGAT
ATCGCCACCC CGGCGCAACT CGCCTATAGC TACGGCGTGA ATCCCTGCAA GGATGTGGTG
AAAAACGGTA AGTTAGTGCA TCAATAA
 
Protein sequence
MSWDQVWIDV NVATMDPSIS APYGAITNAA IAVKDGKIAW LGPRSELPAF DVLSIPVYRG 
KGGWITPGLI DAHTHLVFAG NRANEFELRL KGATYEEIAR AGGGIISTVN ACREADEAEL
FDLGRQRLNA LAKEGVTTVE IKSGYGLDTE TELKILRVAR ELGQHHHVDV KTTFLGAHAV
PPEFKDNSDG YVDLIINKML PAVIAENLAD AVDVFCENIA FNLEQTERVL SAAKAAGLQV
KLHAEQLSNM GGSELAARLG AKSVDHIEYL DEAGVKALSE SGTCAVLLPG AFYFLRETQK
PPIDLLRQYG VPMVLASDFN PGSSPICSTL LMLNMGCTLF RLTPEEALAG LTLNAAKALG
IEENVGSLVV GKQADFCLWD IATPAQLAYS YGVNPCKDVV KNGKLVHQ