Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shewmr4_1350 |
Symbol | |
ID | 4251369 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sp. MR-4 |
Kingdom | Bacteria |
Replicon accession | NC_008321 |
Strand | - |
Start bp | 1563755 |
End bp | 1566829 |
Gene Length | 3075 bp |
Protein Length | 1024 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 638117934 |
Product | amidohydrolase |
Protein accession | YP_733485 |
Protein GI | 113969692 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1228] Imidazolonepropionase and related amidohydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.535703 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.023067 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGATAA ACAATAAGAT AATAATAAGC TGCGGCATTA TAAGTAGTTT GGTATCCCAC AACATTTTTG CCGAGGCCAA CCCCATCTCC CCCGTCAATA AGTTAACGGC CTTCACTAAT GCTGAATTAA TCATGGCGCC AGGGAAGCGA CTCACAAATG CCACCTTATT AGTAGAAAAT AATCGGATCA AAGCCATTAT TGAAAAAGGG GATATTCCAG AGGCAGCATT AAAGGTTGAT CTCAGTGGCT ATACTATCTA TCCCGGTTTT ATCGATCCCT TTACCGATTA TGGTATCGAG TTTGAATATC CCAAATTGGG GCTGACTCGT CCGGTATATG ATATTAAACG TATCGGTGGT AATGCCGAAA ATGGTGCAAT TCACGCTGAA AAAGAATGGT TTAATTACGT TTACCCAAAT AAAGAACGCG CACGTGATTG GATCAATAAC GGCTTTACCA GCGTGCAAAG CAGTAAACTC GACGGTATTT TCCGCGGTAC AGGTGTTAGC CTTTCATTAG CCGATAAAAC CGCCAACGAG GTGATTTATC GTGCACGCAG CCAACCTTTT ATGGCCTTCG ATAAAGGCAC GTCTGAACAA GACTACCCCA ATTCTCTGAT GGGCAGTATC GCGCTTATCC GACAAACCTT TGCCGATGCT AACTGGTATA ACCAAAATAA GCACAAATCG GTTAATAGTT CAGCAAATGC TCAAATTGAA TTTAATATCG CCTTCGAACG CTTAGATAAC TTAGCCGAGA AACAAATCGT CTTTGAAACA AAAAACCTCA ACGATTTGCT GCGCGCCGCG AACCTGCTCA AAGAACATCA GCAGCCAGCC AACTTACTCG CAAGTGGTCA AGAATATGCG CGCATCAATG AATTACAGAC CCTGAATTAT CCTCTCATCC TGCCGCTCAA CTATCCGCAA GCACCGGATG TGGGTACCGA TGATGCCGAC CGTGAAGTCT CCCTTGCAGA ACTCAGACAA TGGGAACGCG CCCCCACAAA CCCAGCAGCA GTGGCTAATG CGGGCATCCC TTTTGCGTTT ACCCAATTTG GCATTAAGAC CGAGGCATTT TGGCCTCGCC TGCGTCAAGC CATCGCCCAG GGACTGAGTG AGGACAAGGC CCTCGCCGCA CTGACAACTC AAGCCGCCGA GATGGCGGGT ACGGCCGAGT TCGCCGGTAA ACTTGCTCCC GGCTATATGG CCGACTTTGT GATTACTAAG GGCAATATCT TTAAAGATGG CCAGATTTAC AGTGTCTGGC TGCAGGGCAA AGAGCAGTCC ATTCGCTCAC TCCCTCAGGC CAAACTCTTG GGCGATTATC AGCTCACGCT GAATAATCTC ACCTTAGATC TAAGCCTTGA AGAAACGGGC AAAGCGGGCA AAACCGCCTT CCAAGGTCAG CTGAGTAGTG GCGAGAAGAG CATTACACTC ACCAATCTTC AGCTTGAAGA CGACGGCCGC GTGAGCTTTA ACGCCGATTT AAGCGATGCG GGCATACATG GCATCAGCCG TTTTACCCTC TGGCTCGCTA AAGATGGTAT TCAAGGCCGC ATGGTGGACG CCCAAAGTCG CAGTATTAAT GTGGCGGGTA TTGCTATCGC TTCGAGTCCA AAGACGGAGC AAGAAGGCAC GTCTAAAACC GATTCCGCAC CAACCTTAGT CAGTCAACTC ACCTACCCTA ACGTCGCCTA TGGTTTGAGC GAGGCGCCAA AAGCCGAAAA ACTCCATATT AAAAATGCCA CCCTGTGGAC CTCGGATAAA CAGGGGATTT TGGAACATGC GGATCTGTTA ATGGCCAATG GCCGAATTGA AAAAATCGGT CAGCAACTCA GCACGCCTTC GGGTTATCAA GTTCTCGATG CAACGGGTAA ACACTTAACT GCGGGTATCG TCGATGAACA TTCCCATATT GCTATCAATG GTGGCACGAA TGAAGGTACA GATGCAGTCA CCTCTGAGGT GCGGATTGGC GATGTGATCA ATCCCGAAGA TATCTCCATC TACCGTGCAC TCGCGGGCGG CGTCACCAGC GCACAATTAC TCCATGGCAG CGCTAACCCA ATTGGTGGCC AATCCCAGTT AATCAAGATG AAATGGGGTG AGAGTGCTGA ACAGCTTAAA TTTGCCAATG CCCCTGCCAG TATCAAATTC GCCCTTGGCG AAAACGTCAA ACAGAGTAAC TGGGGCGAGA AATTTGTTCA GCGCTTCCCG CAGACTCGCA TGGGCGTAAA AGCTTTATTT GAAGAAACCT TCGATGCCGC AATCACCTAT GAGAAAGCGC TGAAGGACTA CGATGACTTA CGTAGCAGTG AGAAGAAGAA AACCATTGCG CCGCGCCCAA GCTATCGCCT ACAGGCGGTA GCCGAAGTGT TAAAGCAGCA GCGCGATGTG CATATCCACT CCTACGTGCA GTCAGAAATC TTAATGTTCC TGCGATTAGC CGAAGCCTAT CACTTTAAGG TGCAAACCTT TACCCACGTG CTCGAAGGTT ACAAGGTTGC CAGCGAGCTA GCCGCCCATG GTGCAGGCGC CTCGACCTTT GCCGATTGGT GGGCCTATAA GTTCGAAGTC TATGATGCGA TTCCACAAAA CGCGTGCCTG ATGCAAAAAC AAGGCGTGCT TACCAGCATC AACTCTGACG ACAACGAAAT GCAGCGCAGA CTCAACCAAG AAGCGGCTAA GTCAATGATG TATTGCGGTA TGTCTAAAGA AGACGCCTGG AATATGGTGA CCATCAACCC AGCGAAGCAG CTCAGAGTCG ATGAGTATGT TGGCTCACTC ACCCCAGGCA AAATGGCCGA TATCGTGCTT TGGAACGCCG AGCCGCTGTC GATTTACGCC AAGGTGACCC AGGCTTGGGT CGAAGGCAAA CGCTACTTCG ATCGCGACCA AGACCAGCTT GCCCAGCAGC AGGTCGTCGC AGAGCGTGCA GCCCTTATCC AAAAAATTCT CAGTAGTGAT GATAACGCCA AGGGGGGTGA GAAAGTCACT CCGCTTAAGG AACCTCAATG GCACTGCGAT ACCCATTATC AGGCTTGGGG CCAACATCAT CAGGGAGCAA AATAA
|
Protein sequence | MQINNKIIIS CGIISSLVSH NIFAEANPIS PVNKLTAFTN AELIMAPGKR LTNATLLVEN NRIKAIIEKG DIPEAALKVD LSGYTIYPGF IDPFTDYGIE FEYPKLGLTR PVYDIKRIGG NAENGAIHAE KEWFNYVYPN KERARDWINN GFTSVQSSKL DGIFRGTGVS LSLADKTANE VIYRARSQPF MAFDKGTSEQ DYPNSLMGSI ALIRQTFADA NWYNQNKHKS VNSSANAQIE FNIAFERLDN LAEKQIVFET KNLNDLLRAA NLLKEHQQPA NLLASGQEYA RINELQTLNY PLILPLNYPQ APDVGTDDAD REVSLAELRQ WERAPTNPAA VANAGIPFAF TQFGIKTEAF WPRLRQAIAQ GLSEDKALAA LTTQAAEMAG TAEFAGKLAP GYMADFVITK GNIFKDGQIY SVWLQGKEQS IRSLPQAKLL GDYQLTLNNL TLDLSLEETG KAGKTAFQGQ LSSGEKSITL TNLQLEDDGR VSFNADLSDA GIHGISRFTL WLAKDGIQGR MVDAQSRSIN VAGIAIASSP KTEQEGTSKT DSAPTLVSQL TYPNVAYGLS EAPKAEKLHI KNATLWTSDK QGILEHADLL MANGRIEKIG QQLSTPSGYQ VLDATGKHLT AGIVDEHSHI AINGGTNEGT DAVTSEVRIG DVINPEDISI YRALAGGVTS AQLLHGSANP IGGQSQLIKM KWGESAEQLK FANAPASIKF ALGENVKQSN WGEKFVQRFP QTRMGVKALF EETFDAAITY EKALKDYDDL RSSEKKKTIA PRPSYRLQAV AEVLKQQRDV HIHSYVQSEI LMFLRLAEAY HFKVQTFTHV LEGYKVASEL AAHGAGASTF ADWWAYKFEV YDAIPQNACL MQKQGVLTSI NSDDNEMQRR LNQEAAKSMM YCGMSKEDAW NMVTINPAKQ LRVDEYVGSL TPGKMADIVL WNAEPLSIYA KVTQAWVEGK RYFDRDQDQL AQQQVVAERA ALIQKILSSD DNAKGGEKVT PLKEPQWHCD THYQAWGQHH QGAK
|
| |