Gene Shewmr4_1350 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_1350 
Symbol 
ID4251369 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp1563755 
End bp1566829 
Gene Length3075 bp 
Protein Length1024 aa 
Translation table11 
GC content49% 
IMG OID638117934 
Productamidohydrolase 
Protein accessionYP_733485 
Protein GI113969692 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.535703 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.023067 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGATAA ACAATAAGAT AATAATAAGC TGCGGCATTA TAAGTAGTTT GGTATCCCAC 
AACATTTTTG CCGAGGCCAA CCCCATCTCC CCCGTCAATA AGTTAACGGC CTTCACTAAT
GCTGAATTAA TCATGGCGCC AGGGAAGCGA CTCACAAATG CCACCTTATT AGTAGAAAAT
AATCGGATCA AAGCCATTAT TGAAAAAGGG GATATTCCAG AGGCAGCATT AAAGGTTGAT
CTCAGTGGCT ATACTATCTA TCCCGGTTTT ATCGATCCCT TTACCGATTA TGGTATCGAG
TTTGAATATC CCAAATTGGG GCTGACTCGT CCGGTATATG ATATTAAACG TATCGGTGGT
AATGCCGAAA ATGGTGCAAT TCACGCTGAA AAAGAATGGT TTAATTACGT TTACCCAAAT
AAAGAACGCG CACGTGATTG GATCAATAAC GGCTTTACCA GCGTGCAAAG CAGTAAACTC
GACGGTATTT TCCGCGGTAC AGGTGTTAGC CTTTCATTAG CCGATAAAAC CGCCAACGAG
GTGATTTATC GTGCACGCAG CCAACCTTTT ATGGCCTTCG ATAAAGGCAC GTCTGAACAA
GACTACCCCA ATTCTCTGAT GGGCAGTATC GCGCTTATCC GACAAACCTT TGCCGATGCT
AACTGGTATA ACCAAAATAA GCACAAATCG GTTAATAGTT CAGCAAATGC TCAAATTGAA
TTTAATATCG CCTTCGAACG CTTAGATAAC TTAGCCGAGA AACAAATCGT CTTTGAAACA
AAAAACCTCA ACGATTTGCT GCGCGCCGCG AACCTGCTCA AAGAACATCA GCAGCCAGCC
AACTTACTCG CAAGTGGTCA AGAATATGCG CGCATCAATG AATTACAGAC CCTGAATTAT
CCTCTCATCC TGCCGCTCAA CTATCCGCAA GCACCGGATG TGGGTACCGA TGATGCCGAC
CGTGAAGTCT CCCTTGCAGA ACTCAGACAA TGGGAACGCG CCCCCACAAA CCCAGCAGCA
GTGGCTAATG CGGGCATCCC TTTTGCGTTT ACCCAATTTG GCATTAAGAC CGAGGCATTT
TGGCCTCGCC TGCGTCAAGC CATCGCCCAG GGACTGAGTG AGGACAAGGC CCTCGCCGCA
CTGACAACTC AAGCCGCCGA GATGGCGGGT ACGGCCGAGT TCGCCGGTAA ACTTGCTCCC
GGCTATATGG CCGACTTTGT GATTACTAAG GGCAATATCT TTAAAGATGG CCAGATTTAC
AGTGTCTGGC TGCAGGGCAA AGAGCAGTCC ATTCGCTCAC TCCCTCAGGC CAAACTCTTG
GGCGATTATC AGCTCACGCT GAATAATCTC ACCTTAGATC TAAGCCTTGA AGAAACGGGC
AAAGCGGGCA AAACCGCCTT CCAAGGTCAG CTGAGTAGTG GCGAGAAGAG CATTACACTC
ACCAATCTTC AGCTTGAAGA CGACGGCCGC GTGAGCTTTA ACGCCGATTT AAGCGATGCG
GGCATACATG GCATCAGCCG TTTTACCCTC TGGCTCGCTA AAGATGGTAT TCAAGGCCGC
ATGGTGGACG CCCAAAGTCG CAGTATTAAT GTGGCGGGTA TTGCTATCGC TTCGAGTCCA
AAGACGGAGC AAGAAGGCAC GTCTAAAACC GATTCCGCAC CAACCTTAGT CAGTCAACTC
ACCTACCCTA ACGTCGCCTA TGGTTTGAGC GAGGCGCCAA AAGCCGAAAA ACTCCATATT
AAAAATGCCA CCCTGTGGAC CTCGGATAAA CAGGGGATTT TGGAACATGC GGATCTGTTA
ATGGCCAATG GCCGAATTGA AAAAATCGGT CAGCAACTCA GCACGCCTTC GGGTTATCAA
GTTCTCGATG CAACGGGTAA ACACTTAACT GCGGGTATCG TCGATGAACA TTCCCATATT
GCTATCAATG GTGGCACGAA TGAAGGTACA GATGCAGTCA CCTCTGAGGT GCGGATTGGC
GATGTGATCA ATCCCGAAGA TATCTCCATC TACCGTGCAC TCGCGGGCGG CGTCACCAGC
GCACAATTAC TCCATGGCAG CGCTAACCCA ATTGGTGGCC AATCCCAGTT AATCAAGATG
AAATGGGGTG AGAGTGCTGA ACAGCTTAAA TTTGCCAATG CCCCTGCCAG TATCAAATTC
GCCCTTGGCG AAAACGTCAA ACAGAGTAAC TGGGGCGAGA AATTTGTTCA GCGCTTCCCG
CAGACTCGCA TGGGCGTAAA AGCTTTATTT GAAGAAACCT TCGATGCCGC AATCACCTAT
GAGAAAGCGC TGAAGGACTA CGATGACTTA CGTAGCAGTG AGAAGAAGAA AACCATTGCG
CCGCGCCCAA GCTATCGCCT ACAGGCGGTA GCCGAAGTGT TAAAGCAGCA GCGCGATGTG
CATATCCACT CCTACGTGCA GTCAGAAATC TTAATGTTCC TGCGATTAGC CGAAGCCTAT
CACTTTAAGG TGCAAACCTT TACCCACGTG CTCGAAGGTT ACAAGGTTGC CAGCGAGCTA
GCCGCCCATG GTGCAGGCGC CTCGACCTTT GCCGATTGGT GGGCCTATAA GTTCGAAGTC
TATGATGCGA TTCCACAAAA CGCGTGCCTG ATGCAAAAAC AAGGCGTGCT TACCAGCATC
AACTCTGACG ACAACGAAAT GCAGCGCAGA CTCAACCAAG AAGCGGCTAA GTCAATGATG
TATTGCGGTA TGTCTAAAGA AGACGCCTGG AATATGGTGA CCATCAACCC AGCGAAGCAG
CTCAGAGTCG ATGAGTATGT TGGCTCACTC ACCCCAGGCA AAATGGCCGA TATCGTGCTT
TGGAACGCCG AGCCGCTGTC GATTTACGCC AAGGTGACCC AGGCTTGGGT CGAAGGCAAA
CGCTACTTCG ATCGCGACCA AGACCAGCTT GCCCAGCAGC AGGTCGTCGC AGAGCGTGCA
GCCCTTATCC AAAAAATTCT CAGTAGTGAT GATAACGCCA AGGGGGGTGA GAAAGTCACT
CCGCTTAAGG AACCTCAATG GCACTGCGAT ACCCATTATC AGGCTTGGGG CCAACATCAT
CAGGGAGCAA AATAA
 
Protein sequence
MQINNKIIIS CGIISSLVSH NIFAEANPIS PVNKLTAFTN AELIMAPGKR LTNATLLVEN 
NRIKAIIEKG DIPEAALKVD LSGYTIYPGF IDPFTDYGIE FEYPKLGLTR PVYDIKRIGG
NAENGAIHAE KEWFNYVYPN KERARDWINN GFTSVQSSKL DGIFRGTGVS LSLADKTANE
VIYRARSQPF MAFDKGTSEQ DYPNSLMGSI ALIRQTFADA NWYNQNKHKS VNSSANAQIE
FNIAFERLDN LAEKQIVFET KNLNDLLRAA NLLKEHQQPA NLLASGQEYA RINELQTLNY
PLILPLNYPQ APDVGTDDAD REVSLAELRQ WERAPTNPAA VANAGIPFAF TQFGIKTEAF
WPRLRQAIAQ GLSEDKALAA LTTQAAEMAG TAEFAGKLAP GYMADFVITK GNIFKDGQIY
SVWLQGKEQS IRSLPQAKLL GDYQLTLNNL TLDLSLEETG KAGKTAFQGQ LSSGEKSITL
TNLQLEDDGR VSFNADLSDA GIHGISRFTL WLAKDGIQGR MVDAQSRSIN VAGIAIASSP
KTEQEGTSKT DSAPTLVSQL TYPNVAYGLS EAPKAEKLHI KNATLWTSDK QGILEHADLL
MANGRIEKIG QQLSTPSGYQ VLDATGKHLT AGIVDEHSHI AINGGTNEGT DAVTSEVRIG
DVINPEDISI YRALAGGVTS AQLLHGSANP IGGQSQLIKM KWGESAEQLK FANAPASIKF
ALGENVKQSN WGEKFVQRFP QTRMGVKALF EETFDAAITY EKALKDYDDL RSSEKKKTIA
PRPSYRLQAV AEVLKQQRDV HIHSYVQSEI LMFLRLAEAY HFKVQTFTHV LEGYKVASEL
AAHGAGASTF ADWWAYKFEV YDAIPQNACL MQKQGVLTSI NSDDNEMQRR LNQEAAKSMM
YCGMSKEDAW NMVTINPAKQ LRVDEYVGSL TPGKMADIVL WNAEPLSIYA KVTQAWVEGK
RYFDRDQDQL AQQQVVAERA ALIQKILSSD DNAKGGEKVT PLKEPQWHCD THYQAWGQHH
QGAK