Gene Shewmr4_3421 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_3421 
Symbol 
ID4253987 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp4090039 
End bp4092891 
Gene Length2853 bp 
Protein Length950 aa 
Translation table11 
GC content53% 
IMG OID638120059 
Productexcinuclease ABC subunit A 
Protein accessionYP_735544 
Protein GI113971751 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00603968 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATAAGA TTGAAATACG CGGCGCACGG ACCCACAATC TCAAAAATAT CAACCTGACT 
ATCCCAAGGG ATAAGTTAAT CGTTATCACA GGGTTATCGG GTTCAGGCAA ATCTTCCCTA
GCATTTGATA CCTTATATGC CGAAGGCCAA CGACGTTACG TTGAGTCTCT TTCTGCCTAT
GCCCGCCAAT TCTTAAGCTT GATGGAAAAG CCCGATGTCG ACCATATCGA AGGCTTGAGC
CCGGCAATTT CCATCGAACA AAAATCGACG TCCCATAACC CGCGCTCCAC CGTCGGGACT
ATTACCGAAA TTTACGACTA TCTGCGTCTG TTGTTTGCCC GTGTGGGCGA ACCGCGCTGC
CCGACACACG GCCAACCCCT TGCGGCGCAA ACCGTGAGTC AGATGGTCGA TAAAGTGTTA
GAAATGCCCG AAGACAGCCG CTTGATGCTG CTGGCGCCCG TAGTCAACGG TCGTAAGGGT
GAGCACGTTA AACTACTCGA AGGTTTATCG GCGCAGGGCT ATATCCGCGC CCGAATCGAT
GGCGAAGTCT GCGACTTAAC CGATCCACCA ACACTCGATT TACACGTTAA GCACACCATC
GAAGTGGTGG TCGACCGCTT TAAAGTCCGC AGCGATATCC AGCAGCGTCT CGCCGAATCC
TTTGAAACCG CGCTCGAACT CTCCGGCGGG ATTGCCGTGG TCGCCAGCAT GGATGAAGGC
AACACTGAAG AATTAATCTT CTCTGCAAAT TTTGCCTGTC CGCACTGCGG TTATTCAATG
GCAGAGCTTG AGCCACGGAT TTTCTCCTTT AACAACCCGG CGGGCGCTTG CCCGACCTGT
GATGGTTTAG GGGTACAACA GTTTTTCGAC CCCGACAGAG TGATCACTAA TCCTGAGTTA
TCCCTTGCGG GCGGCGCGAT TCGTGGCTGG GATAGACGTA ACTTCTATTA CTTTCAGATG
CTGAGTTCGC TCGCCGACCA CTATAAGTTC GATGTCGAAA TTCCCTTCGA ACAGCTGTCG
GATAAGGTGA GAAAAATCGT CCTCTACGGC TCAGGCAAAG ACAGCATCGC CTTCAAATAC
ATCAACGACC GTGGTGATGT GGTGGTGCGC ACTCACCCCT TCGAAGGCAT CTTAAATAAT
ATGGACCGGC GCTACCGCGA GACCGAAAGC AACGCGGTGC GCGAAGAATT AGCCAAGTTT
ATCAATACTC AAGCCTGCCA AAGCTGTGGC GGGTCGCGCT TGCGCGAAGA AGCCCGTAAC
GTGTTTATTG GCGATCTTAA CCTGCCAAAA CTCACCGTTT GGTCGATAGG TGAAGCCCTA
GACTACTTCG ACAAACTCGA GTTCAGCGGC CAAAAGGCAC AGATCGCCGA GAAGGTACTG
AAGGAAGTGC GCGATCGCTT AGGCTTCCTA GTTAACGTCG GCCTCAACTA TTTAAGTCTG
TCGCGTTCGG CCGAAACCCT CTCAGGCGGT GAAGCGCAGC GTATTCGTCT CGCCAGCCAA
ATCGGCGCAG GACTCGTCGG CGTGATGTAT GTGCTTGACG AGCCTTCTAT CGGCTTGCAC
CAGCGAGATA ACGAACGGTT ATTGCAGACC TTAATCCACC TGCGGGATTT AGGTAATACA
GTGATCGTAG TGGAGCATGA TGAAGATGCG ATTCGTATCG CGGATCATAT TATCGATATC
GGCCCAGGCG CTGGGGTACA CGGCGGCGAA GTGATTTGCG ACGGCCCAAT TGAGAAAATT
GTCGCCTGCG ACGAGTCCGT GACTGGCCAA TATATTTCGG GCAAACGCAA TATCCATATC
AGCACTCCAC GTACGCCTTA CGATCCAAAA CAAGTGATTG AGCTTTACGG CGCCCGCGGC
AATAACCTGC GCAATGTCGA CTTAACGGTT CCCGTAGGCC TGTTCACCTG CGTGACCGGG
GTATCGGGTT CGGGTAAATC GACGCTGATT AACGACACCT TCTTTAGAAT TGCCCATAAG
CAACTCAACG GTGCCACAGT CGATGAGCCT GCACCTTACG ATCGCATCGT CGGCATGGAG
CAATGCGATA AAGTGGTCGA TATCGACCAA AGCCCAATTG GCCGCACGCC GCGCTCCAAC
CCTGCCACTT ATACCGGTAT CTTTACGCCT ATTCGGGAGA TTTTTGCTGG AACGCAGGAG
TCCCGTACCC GTGGTTATCA AGTGGGCCGT TTCTCCTTTA ACGTGAAAGG CGGTCGCTGT
GAGGCCTGCC AAGGTGATGG CTTAATTAAG GTCGAAATGC ACTTCCTGCC CGATGTGTAT
GTGCCTTGCG ATGCCTGTAA GGGTAAACGC TATAACCGCG AAACCTTAGA GGTGCGCTAC
AAGGGTAAGA ACATCCACGA AGTGCTGCAA ATGACAGTGG AAGATGCGCG GGAGTTTTTC
GATGCCGTGC CAGCCATTGC CCGTAAACTG CAAACCCTGA TGGATGTGGG TTTGTCCTAC
GTACGTTTAG GCCAGAGCGC GACCACGCTG TCGGGCGGTG AAGCCCAAAG GGTGAAACTC
GCGAAGGAAC TCTCTAAGCG CGATACGGGT AAGACCTTGT ATATTCTGGA TGAACCGACA
ACGGGCTTGC ACTTTGCCGA TATCCAATTG CTGCTCGATG TGCTGCATCG CCTCAAATCC
CATGGCAATA CTATCGTGGT GATTGAGCAT AATCTGGATG TGATTAAAAC CGCAGACTGG
ATTATCGACT TAGGTCCAGA AGGCGGCGGT GGCGGCGGCA CTATCCTAGC AACTGGTACG
CCAGAAGAGG TGGCAGAGCA TCCAACGTCC CATACCGCGC GCTTCCTAAA GCCGCTGCTG
GAAAGGGATG CTAAGCTAGC CAAACAGTCA TAA
 
Protein sequence
MDKIEIRGAR THNLKNINLT IPRDKLIVIT GLSGSGKSSL AFDTLYAEGQ RRYVESLSAY 
ARQFLSLMEK PDVDHIEGLS PAISIEQKST SHNPRSTVGT ITEIYDYLRL LFARVGEPRC
PTHGQPLAAQ TVSQMVDKVL EMPEDSRLML LAPVVNGRKG EHVKLLEGLS AQGYIRARID
GEVCDLTDPP TLDLHVKHTI EVVVDRFKVR SDIQQRLAES FETALELSGG IAVVASMDEG
NTEELIFSAN FACPHCGYSM AELEPRIFSF NNPAGACPTC DGLGVQQFFD PDRVITNPEL
SLAGGAIRGW DRRNFYYFQM LSSLADHYKF DVEIPFEQLS DKVRKIVLYG SGKDSIAFKY
INDRGDVVVR THPFEGILNN MDRRYRETES NAVREELAKF INTQACQSCG GSRLREEARN
VFIGDLNLPK LTVWSIGEAL DYFDKLEFSG QKAQIAEKVL KEVRDRLGFL VNVGLNYLSL
SRSAETLSGG EAQRIRLASQ IGAGLVGVMY VLDEPSIGLH QRDNERLLQT LIHLRDLGNT
VIVVEHDEDA IRIADHIIDI GPGAGVHGGE VICDGPIEKI VACDESVTGQ YISGKRNIHI
STPRTPYDPK QVIELYGARG NNLRNVDLTV PVGLFTCVTG VSGSGKSTLI NDTFFRIAHK
QLNGATVDEP APYDRIVGME QCDKVVDIDQ SPIGRTPRSN PATYTGIFTP IREIFAGTQE
SRTRGYQVGR FSFNVKGGRC EACQGDGLIK VEMHFLPDVY VPCDACKGKR YNRETLEVRY
KGKNIHEVLQ MTVEDAREFF DAVPAIARKL QTLMDVGLSY VRLGQSATTL SGGEAQRVKL
AKELSKRDTG KTLYILDEPT TGLHFADIQL LLDVLHRLKS HGNTIVVIEH NLDVIKTADW
IIDLGPEGGG GGGTILATGT PEEVAEHPTS HTARFLKPLL ERDAKLAKQS