Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shewmr4_3421 |
Symbol | |
ID | 4253987 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sp. MR-4 |
Kingdom | Bacteria |
Replicon accession | NC_008321 |
Strand | + |
Start bp | 4090039 |
End bp | 4092891 |
Gene Length | 2853 bp |
Protein Length | 950 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 638120059 |
Product | excinuclease ABC subunit A |
Protein accession | YP_735544 |
Protein GI | 113971751 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0178] Excinuclease ATPase subunit |
TIGRFAM ID | [TIGR00630] excinuclease ABC, A subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00603968 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATAAGA TTGAAATACG CGGCGCACGG ACCCACAATC TCAAAAATAT CAACCTGACT ATCCCAAGGG ATAAGTTAAT CGTTATCACA GGGTTATCGG GTTCAGGCAA ATCTTCCCTA GCATTTGATA CCTTATATGC CGAAGGCCAA CGACGTTACG TTGAGTCTCT TTCTGCCTAT GCCCGCCAAT TCTTAAGCTT GATGGAAAAG CCCGATGTCG ACCATATCGA AGGCTTGAGC CCGGCAATTT CCATCGAACA AAAATCGACG TCCCATAACC CGCGCTCCAC CGTCGGGACT ATTACCGAAA TTTACGACTA TCTGCGTCTG TTGTTTGCCC GTGTGGGCGA ACCGCGCTGC CCGACACACG GCCAACCCCT TGCGGCGCAA ACCGTGAGTC AGATGGTCGA TAAAGTGTTA GAAATGCCCG AAGACAGCCG CTTGATGCTG CTGGCGCCCG TAGTCAACGG TCGTAAGGGT GAGCACGTTA AACTACTCGA AGGTTTATCG GCGCAGGGCT ATATCCGCGC CCGAATCGAT GGCGAAGTCT GCGACTTAAC CGATCCACCA ACACTCGATT TACACGTTAA GCACACCATC GAAGTGGTGG TCGACCGCTT TAAAGTCCGC AGCGATATCC AGCAGCGTCT CGCCGAATCC TTTGAAACCG CGCTCGAACT CTCCGGCGGG ATTGCCGTGG TCGCCAGCAT GGATGAAGGC AACACTGAAG AATTAATCTT CTCTGCAAAT TTTGCCTGTC CGCACTGCGG TTATTCAATG GCAGAGCTTG AGCCACGGAT TTTCTCCTTT AACAACCCGG CGGGCGCTTG CCCGACCTGT GATGGTTTAG GGGTACAACA GTTTTTCGAC CCCGACAGAG TGATCACTAA TCCTGAGTTA TCCCTTGCGG GCGGCGCGAT TCGTGGCTGG GATAGACGTA ACTTCTATTA CTTTCAGATG CTGAGTTCGC TCGCCGACCA CTATAAGTTC GATGTCGAAA TTCCCTTCGA ACAGCTGTCG GATAAGGTGA GAAAAATCGT CCTCTACGGC TCAGGCAAAG ACAGCATCGC CTTCAAATAC ATCAACGACC GTGGTGATGT GGTGGTGCGC ACTCACCCCT TCGAAGGCAT CTTAAATAAT ATGGACCGGC GCTACCGCGA GACCGAAAGC AACGCGGTGC GCGAAGAATT AGCCAAGTTT ATCAATACTC AAGCCTGCCA AAGCTGTGGC GGGTCGCGCT TGCGCGAAGA AGCCCGTAAC GTGTTTATTG GCGATCTTAA CCTGCCAAAA CTCACCGTTT GGTCGATAGG TGAAGCCCTA GACTACTTCG ACAAACTCGA GTTCAGCGGC CAAAAGGCAC AGATCGCCGA GAAGGTACTG AAGGAAGTGC GCGATCGCTT AGGCTTCCTA GTTAACGTCG GCCTCAACTA TTTAAGTCTG TCGCGTTCGG CCGAAACCCT CTCAGGCGGT GAAGCGCAGC GTATTCGTCT CGCCAGCCAA ATCGGCGCAG GACTCGTCGG CGTGATGTAT GTGCTTGACG AGCCTTCTAT CGGCTTGCAC CAGCGAGATA ACGAACGGTT ATTGCAGACC TTAATCCACC TGCGGGATTT AGGTAATACA GTGATCGTAG TGGAGCATGA TGAAGATGCG ATTCGTATCG CGGATCATAT TATCGATATC GGCCCAGGCG CTGGGGTACA CGGCGGCGAA GTGATTTGCG ACGGCCCAAT TGAGAAAATT GTCGCCTGCG ACGAGTCCGT GACTGGCCAA TATATTTCGG GCAAACGCAA TATCCATATC AGCACTCCAC GTACGCCTTA CGATCCAAAA CAAGTGATTG AGCTTTACGG CGCCCGCGGC AATAACCTGC GCAATGTCGA CTTAACGGTT CCCGTAGGCC TGTTCACCTG CGTGACCGGG GTATCGGGTT CGGGTAAATC GACGCTGATT AACGACACCT TCTTTAGAAT TGCCCATAAG CAACTCAACG GTGCCACAGT CGATGAGCCT GCACCTTACG ATCGCATCGT CGGCATGGAG CAATGCGATA AAGTGGTCGA TATCGACCAA AGCCCAATTG GCCGCACGCC GCGCTCCAAC CCTGCCACTT ATACCGGTAT CTTTACGCCT ATTCGGGAGA TTTTTGCTGG AACGCAGGAG TCCCGTACCC GTGGTTATCA AGTGGGCCGT TTCTCCTTTA ACGTGAAAGG CGGTCGCTGT GAGGCCTGCC AAGGTGATGG CTTAATTAAG GTCGAAATGC ACTTCCTGCC CGATGTGTAT GTGCCTTGCG ATGCCTGTAA GGGTAAACGC TATAACCGCG AAACCTTAGA GGTGCGCTAC AAGGGTAAGA ACATCCACGA AGTGCTGCAA ATGACAGTGG AAGATGCGCG GGAGTTTTTC GATGCCGTGC CAGCCATTGC CCGTAAACTG CAAACCCTGA TGGATGTGGG TTTGTCCTAC GTACGTTTAG GCCAGAGCGC GACCACGCTG TCGGGCGGTG AAGCCCAAAG GGTGAAACTC GCGAAGGAAC TCTCTAAGCG CGATACGGGT AAGACCTTGT ATATTCTGGA TGAACCGACA ACGGGCTTGC ACTTTGCCGA TATCCAATTG CTGCTCGATG TGCTGCATCG CCTCAAATCC CATGGCAATA CTATCGTGGT GATTGAGCAT AATCTGGATG TGATTAAAAC CGCAGACTGG ATTATCGACT TAGGTCCAGA AGGCGGCGGT GGCGGCGGCA CTATCCTAGC AACTGGTACG CCAGAAGAGG TGGCAGAGCA TCCAACGTCC CATACCGCGC GCTTCCTAAA GCCGCTGCTG GAAAGGGATG CTAAGCTAGC CAAACAGTCA TAA
|
Protein sequence | MDKIEIRGAR THNLKNINLT IPRDKLIVIT GLSGSGKSSL AFDTLYAEGQ RRYVESLSAY ARQFLSLMEK PDVDHIEGLS PAISIEQKST SHNPRSTVGT ITEIYDYLRL LFARVGEPRC PTHGQPLAAQ TVSQMVDKVL EMPEDSRLML LAPVVNGRKG EHVKLLEGLS AQGYIRARID GEVCDLTDPP TLDLHVKHTI EVVVDRFKVR SDIQQRLAES FETALELSGG IAVVASMDEG NTEELIFSAN FACPHCGYSM AELEPRIFSF NNPAGACPTC DGLGVQQFFD PDRVITNPEL SLAGGAIRGW DRRNFYYFQM LSSLADHYKF DVEIPFEQLS DKVRKIVLYG SGKDSIAFKY INDRGDVVVR THPFEGILNN MDRRYRETES NAVREELAKF INTQACQSCG GSRLREEARN VFIGDLNLPK LTVWSIGEAL DYFDKLEFSG QKAQIAEKVL KEVRDRLGFL VNVGLNYLSL SRSAETLSGG EAQRIRLASQ IGAGLVGVMY VLDEPSIGLH QRDNERLLQT LIHLRDLGNT VIVVEHDEDA IRIADHIIDI GPGAGVHGGE VICDGPIEKI VACDESVTGQ YISGKRNIHI STPRTPYDPK QVIELYGARG NNLRNVDLTV PVGLFTCVTG VSGSGKSTLI NDTFFRIAHK QLNGATVDEP APYDRIVGME QCDKVVDIDQ SPIGRTPRSN PATYTGIFTP IREIFAGTQE SRTRGYQVGR FSFNVKGGRC EACQGDGLIK VEMHFLPDVY VPCDACKGKR YNRETLEVRY KGKNIHEVLQ MTVEDAREFF DAVPAIARKL QTLMDVGLSY VRLGQSATTL SGGEAQRVKL AKELSKRDTG KTLYILDEPT TGLHFADIQL LLDVLHRLKS HGNTIVVIEH NLDVIKTADW IIDLGPEGGG GGGTILATGT PEEVAEHPTS HTARFLKPLL ERDAKLAKQS
|
| |