Gene GSU3325 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU3325 
SymboluvrA 
ID2686439 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp3652846 
End bp3655659 
Gene Length2814 bp 
Protein Length937 aa 
Translation table11 
GC content63% 
IMG OID637128019 
Productexcinuclease ABC, A subunit 
Protein accessionNP_954365 
Protein GI39998414 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCACACG ACACCATCAT CGTCAAAGGC GCGTGCGAAC ATAACCTCAA GTGCATCGAC 
GTGGAGATCC CTCGGGACAA GCTCGTCGTC ATCACCGGCA TTTCCGGGTC GGGAAAATCG
ACCCTGGCCT TCGATACCAT CTATGCCGAA GGGCAGCGCC GCTACGTGGA ATCCCTCTCG
GCCTATGCCC GCCAGTTTCT GGAGCAGATG GAGAAGCCCG ACGTGGAATC CATTGAGGGG
CTTTCCCCTG CCATCTCCAT CGAACAGAAG ACCACCAGCC GCAACCCCCG CTCCACCGTG
GGAACCGTCA CCGAGATCTA CGACTACCTT CGGCTCCTCT TCGCCCGCGT CGGCCACCCC
CACTGCTATG AATGCGGCAA ACCGATCACC TCCCAGACTG TCTCCCAGAT GGTGGACCAG
ATCATGGCCT TGCCCGCCGG AACCAGGCTT CAGCTTCTCT CGCCCATGGT CCGGGGTCGC
AAGGGGGAGT ACCGCAAGGA ACTGGCCCAG TTGCGCAAGG ACGGCTTCGC CCGGGTCATC
GTGGATGGCG TGCAGCATGA GCTGGCCGAG GAGATTCACC TCGACAAAAA CAAGAAACAC
GATATCGATA TCGTGGTGGA TCGACTCATT ATCAAGGAGG GAATCGAGCG GCGCCTGGCC
GACTCCCTGG AAACGGCCCT GAACCACGCG GAAGGGGTGG TAAAGGTCCA GGTCGTGGAC
GGCGACACCA TCCTTTTCTC CGAGGCACTG GCCTGCATCG ACTGCGGCAT CTCCTACCCC
GAGATGACCC CCCGGATGTT CTCATTCAAC AACCCCTACG GCGCCTGCCC CGACTGCACC
GGCCTCGGCA CGCGGATGTA CTTCGACGAG GAACTGGTCG TGCCGAACCC GGAACTCTCC
ATCCGCGAAG GAGCCATCGC CCCATGGGAG AAACGGCTCT CGGCCTGGTA CCACATGACT
CTGGACGCCC TGGCCAAGGC CTTTGACTTC GACATCCGGA CCCCCTTCAA AGAGCTCTCT
CCCCGGGTGC GCGAGGTGAT CCTGCGCGGA TCCAAAGGCG AAAAAGTTGA GTTCTGGTGG
GAAGAGGACG GTGGGCGGCG TCACACCTAC ACCAAGGAAT TCGAAGGGGT CATCCCCAAT
CTGGAGCGGC GCTACCGGGA AAGCGACTCG GAGCAGGTGC GGGAAGAGCT GGAGCGCTAC
ATGAACGTAA TGCCCTGCCC CACCTGCCAG GGGGCGCGCC TGAAGCGCGA GGCCCTTCAC
GTGAAGGTGG CGGAGCGGGA CATCCGTCAG GTAACCGCTC TCTCCATCAA GGACGCCTTG
GAGTTTTTCG CCTCCCTCAC GCTCACCCCG AAAGAGGAGG AGATCGCCCG CCGCATCCTG
AAAGAAATCA GGGAGCGGCT CCACTTCCTA GTTAACGTGG GACTCGACTA CCTGTCCCTG
GACCGGACCT CGGGCACCCT CTCCGGCGGT GAAGGGCAGC GGATCCGGCT CGCCACCCAG
ATCGGCTCCA GCCTCGTGGG GGTTCTCTAC ATCCTGGACG AGCCCTCCAT CGGCCTGCAC
CAGCGGGACA ACGGCCGACT GTTGCAGACC CTCAAGCACC TGCGCGACAT CGGCAACACG
GTGCTGGTGG TGGAGCACGA CGAGGAGACG ATCCTGGAGG CGGACCACGT GCTCGACATG
GGGCCCGGCG CCGGTGAGCA CGGGGGCCGC GTGGTGGCCC AGGGAACCCC GGCGGAGATC
ATGGCGAACC CCGAGTCCCT CACGGGCCGC TATCTCTCAG GAGAGCTCAC CATCGCCGTG
CCTAAAAAGC GGCGCAAGCC CAAGCGCTTC ATCACCGTGG AGGGAGCCGC GGAAAACAAC
CTGAAGGATG TCACCGTCGA CATCCCCCTC GGTGTCATGA CCTGCGTCAC CGGGGTGTCC
GGGTCGGGCA AATCGACACT CGTGATCGAC ACCCTCTACA AAGTCCTGGG CCAGCGGCTC
TACCGGAGCC GGGAGCGGGC CGGCGCAGTA CGCGACATCC GGGGACTGGA ACAACTGGAC
AAGGTCATCA ACATCGACCA GTCGCCCATC GGCCGCACGC CGCGCTCGAA CCCCGCCACC
TACACCGGGG TCTTCGCCGA TATCCGGGAT CTCTTCGCCC AACTCCCCGA ATCCAAGGTG
CGGGGCTATA AGCCGGGGCG CTACTCGTTC AACGTGAAGG GGGGACGGTG CGAGGCCTGC
GCCGGGGACG GGATCATCAA GATCGAGATG CACTTTCTCC CCGATGTCTA CGTCCAGTGT
GAGGTCTGCA AGGGAGCCCG CTATAACCGC GAGACCCTGG AGGTTACCTA CAAGGGGAAA
TCCATCGCGC AGGTCCTGGA CATGACCGTT TCGGAGGCCC TGCGCTTCCT GGAGAACATC
CCGAAGGTCA AGGCAAAGCT CCAGACCCTG GAGGAGGTGG GGCTCGGCTA CATCCGCCTG
GGCCAATCGG CCACGACCCT ATCCGGCGGC GAGGCCCAGC GGGTGAAACT GGCCAAGGAA
CTGGCGCGCC GGGCCACCGG CCGAACCATC TACATCCTCG ACGAGCCCAC CACCGGCCTT
CACTTCCACG ACATCGCCAA GCTCCTGGAG GTGCTAAGAA AACTGGTGGA GGGGGGAAAT
ACCATCGTCA TCATCGAGCA CAACCTGGAC GTCATCAAGA CCGCCGATTA CATCATCGAC
CTGGGCCCCG AAGGGGGCGA CCGGGGCGGC GAGGTGATCG CCACCGGCAC ACCTGAGGAG
GTGGCCAAGG TGACACGGTC CTACACGGGA CAGTATTTGC GGAAGATGCT GTGA
 
Protein sequence
MAHDTIIVKG ACEHNLKCID VEIPRDKLVV ITGISGSGKS TLAFDTIYAE GQRRYVESLS 
AYARQFLEQM EKPDVESIEG LSPAISIEQK TTSRNPRSTV GTVTEIYDYL RLLFARVGHP
HCYECGKPIT SQTVSQMVDQ IMALPAGTRL QLLSPMVRGR KGEYRKELAQ LRKDGFARVI
VDGVQHELAE EIHLDKNKKH DIDIVVDRLI IKEGIERRLA DSLETALNHA EGVVKVQVVD
GDTILFSEAL ACIDCGISYP EMTPRMFSFN NPYGACPDCT GLGTRMYFDE ELVVPNPELS
IREGAIAPWE KRLSAWYHMT LDALAKAFDF DIRTPFKELS PRVREVILRG SKGEKVEFWW
EEDGGRRHTY TKEFEGVIPN LERRYRESDS EQVREELERY MNVMPCPTCQ GARLKREALH
VKVAERDIRQ VTALSIKDAL EFFASLTLTP KEEEIARRIL KEIRERLHFL VNVGLDYLSL
DRTSGTLSGG EGQRIRLATQ IGSSLVGVLY ILDEPSIGLH QRDNGRLLQT LKHLRDIGNT
VLVVEHDEET ILEADHVLDM GPGAGEHGGR VVAQGTPAEI MANPESLTGR YLSGELTIAV
PKKRRKPKRF ITVEGAAENN LKDVTVDIPL GVMTCVTGVS GSGKSTLVID TLYKVLGQRL
YRSRERAGAV RDIRGLEQLD KVINIDQSPI GRTPRSNPAT YTGVFADIRD LFAQLPESKV
RGYKPGRYSF NVKGGRCEAC AGDGIIKIEM HFLPDVYVQC EVCKGARYNR ETLEVTYKGK
SIAQVLDMTV SEALRFLENI PKVKAKLQTL EEVGLGYIRL GQSATTLSGG EAQRVKLAKE
LARRATGRTI YILDEPTTGL HFHDIAKLLE VLRKLVEGGN TIVIIEHNLD VIKTADYIID
LGPEGGDRGG EVIATGTPEE VAKVTRSYTG QYLRKML