Gene P9303_30191 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_30191 
SymboluvrA 
ID4776864 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp2674059 
End bp2677034 
Gene Length2976 bp 
Protein Length991 aa 
Translation table11 
GC content49% 
IMG OID640088543 
Productexcinuclease ABC subunit A 
Protein accessionYP_001019014 
Protein GI124024707 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.213667 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGCGAG ATGCCGCCAA GGTCAAGGAT CATCGTTTGA GTAAGCAGCT GAATCTGAGT 
GGTGGCTCAT TGGAGGATGT GATTCGCGTG CGAGGTGCGC GTCAGCACAA CCTCAAGAAC
GTTGATATCA CCCTTCCACG CAACAAGCTG GTGGTTTTAA CCGGAGTAAG TGGAAGCGGA
AAGAGCTCTT TAGCGTTTGA CACGATTTTT GCTGAGGGTC AGCGTCGCTA CGTCGAGAGT
CTTTCCGCTT ATGCCCGTCA GTTTTTGGGT CAGGTTGATA AGCCTGATGT CGATGCTATC
GAGGGGTTGT CACCGGCGAT TTCCATTGAT CAGAAGTCGA CGAGTCACAA TCCTCGTTCA
ACCGTTGGAA CGGTCACGGA AATTCAGGAT TATTTGCGCT TGCTATTCGG ACGTGCTGGA
GAACCGCACT GTCCGCAATG TGATCGGCTA ATCCGTCCTC AGACCATCGA TGAGATGGTG
GATCAGATCC TAATTTTGCC TGAAGGAACC CGTTATCAAT TGTTGGCACC GTTGGTTCGT
GGAAAGAAGG GTACTCACGC CAAATTGTTG AGTGGCCTTG CGGCAGAGGG ATTTGCGAGG
GTTCGGATTA ATGCAGAGGT TAGAGAGCTT GCCGACAACA TTGAATTGGA CAAGAACCAT
CTGCACAGTA TTGAGGTGGT GGTTGATCGT CTGGTGGCTC GGGAAGGGAT TCAGGAACGA
TTAACTGATT CATTGCGTAC CACTTTGATG CGCGGTGATG GCCTTGCTTT GGTGGAGGTT
GTACCTAAAG CGGATCAAGC TCTCCCCGAG GGTGTGGAGC GTGAGCGGCT CTATTCCGAG
AATTTTGCAT GTCCAGTGCA TGGGGCGGTG ATTGAGGAAC TTTCTCCCAG GCTATTTTCG
TTTAATAGCC CTTATGGTGC TTGTTCTGAT TGTCATGGCA TCGGCCATCT TCGTAAGTTC
ACTTTAGAAC GGGTTGTGCC TGATCCCTCC TTGCCTGTTT ATGCCGCTGT GGCGCCTTGG
AGTGATAAGG ACAACAGCTA TTACTTTTCA TTGCTGTATT CAGTTGGTGA GGCCTTTGGT
TTTGAAATTA AAACGCCATG GAAGAACTTA ACGGCAGATC AGCAACATGT ATTGCTTTAT
GGAAGTTCTG AGCCGATTCA GATTCAGGCT GATAGCCGCT ATAAAAAGAG TACAGGTTAT
ATGAGACCTT TTGAGGGCAT TTTGCCCATC TTGGAAAGAC AATTGCGTGA TGCAAGTGGC
GAGGCTGTTA AACAGAAACT TGAGAAGTTT CTTGAGTTGG TCCCTTGTGG AAGTTGTGGT
GGGAAGAGAT TGCGTGCAGA AGCTCTGGCG GTGAAGGTGG GTCCTTATCG GATTACTGAG
CTCACTTCTA TCAGCGTGGC GCTGACTCTT GAACGCATCG AAAAGCTGAT GGGCGTTGGA
GCAGCCAATG GATCTGAGCC TTTGCTCAAT TCACGTCAGA TCCAGATCGG TGATTTGGTT
TTGCGGGAGA TTCGTATGCG TCTTCGCTTT CTTCTCGATG TTGGGCTCGA ATATCTCAGC
TTGGATCGAC CCGCGATGAC CTTGTCTGGC GGTGAGGCTC AGCGGATTCG TTTGGCTACA
CAGATCGGCG CTGGTCTTAC GGGTGTGCTT TATGTGCTTG ATGAACCAAG CATTGGTTTG
CATCAGCGGG ACAATGATCG TCTGTTAGCC ACGCTCAGAC GTTTAAGGGA TCTGGGTAAC
ACTTTAATAG TGGTAGAACA CGATGAGGAC ACCATTCGCG CCGCTGACCA TCTTGTCGAT
ATTGGTCCAG GGGCAGGGGT GCATGGGGGC CATATTGTTG TTGAGGGATC TTTGGATCAT
CTGTTGATGG CTGAAGAATC TTTGACAGGT GCATATCTCA GTGGCCGCCG CTCTATTCCT
ACACCTAGAG AGAGGAGAGA AGGAAGCAGT CGCAGGCTTC GTTTGATTGA TTGTGATCGC
AATAATCTTA AAAATCTCAC CGTAGATTTT CCTTTAGGGC GCCTCGTTGC GGTCACAGGA
GTCAGTGGTA GTGGTAAGAG CACTTTGGTG AATGAGTTGC TTCATCCTGC CTTAGATCAC
AGTTTGGGTT TGAAGGTGCC TTTCCCTCAA GGCTTGGGCG AGTTGCGAGG TGTCAATGCG
ATCGACAAGG TGATCGTGAT TGACCAGTCT CCAATCGGCC GTACACCGCG TTCTAATCCA
GCCACCTACA CCGGTGCGTT TGATCCGATT CGTCAGGTAT TTGCTGCTTC CGTTGAGGCA
AAGGCTCGTG GTTATCAGGT GGGTCAGTTT AGTTTCAATG TGAAAGGTGG TCGCTGTGAG
GCTTGTCGTG GTCAAGGGGT GAATGTGATT GAAATGAACT TTTTGCCAGA TGTTTATGTT
CAGTGCGATG TCTGCAAAGG TGCTCGATTT AATCGGGAGA CCTTACAGGT CACTTACAAG
GGACACACAA TTGCAGATGT TTTGCAGATG ACGGTTGAGC AGGCTGCTGA GGTTTTTTCT
GCTATTCCCC AGGCGGCTGA TCGATTGCGC ACGTTGGTTG ATGTGGGACT TGGATATGTG
AAGTTAGGTC AGCCTGCACC CACGCTTTCA GGTGGTGAAG CTCAACGTGT GAAGCTTGCT
ACCGAGCTCT CCAAGCGGGC TACTGGCAAA ACCCTTTATT TGATTGATGA ACCAACCACT
GGCCTCAGTT TTTACGACGT GCATAAGTTG ATGGATGTCA TGCAGCGTCT TGTTGATAAA
GGCAATTCAA TTATCGTGAT AGAACACAAC TTGGATGTAA TTCGTTGTTC TGATTGGATT
ATTGATCTTG GCCCTGAAGG AGGAGATTGT GGTGGCGATC TTCTGGTCAC GGGGACTCCA
GAAGAGGTTG CATCCCATCC CACTAGCCAT ACGGGGCATT ATCTCAAGAA GGTGTTAGCG
AAGCATCCTC CTGAGACTGT GTCTGTTGCA GCTTGA
 
Protein sequence
MGRDAAKVKD HRLSKQLNLS GGSLEDVIRV RGARQHNLKN VDITLPRNKL VVLTGVSGSG 
KSSLAFDTIF AEGQRRYVES LSAYARQFLG QVDKPDVDAI EGLSPAISID QKSTSHNPRS
TVGTVTEIQD YLRLLFGRAG EPHCPQCDRL IRPQTIDEMV DQILILPEGT RYQLLAPLVR
GKKGTHAKLL SGLAAEGFAR VRINAEVREL ADNIELDKNH LHSIEVVVDR LVAREGIQER
LTDSLRTTLM RGDGLALVEV VPKADQALPE GVERERLYSE NFACPVHGAV IEELSPRLFS
FNSPYGACSD CHGIGHLRKF TLERVVPDPS LPVYAAVAPW SDKDNSYYFS LLYSVGEAFG
FEIKTPWKNL TADQQHVLLY GSSEPIQIQA DSRYKKSTGY MRPFEGILPI LERQLRDASG
EAVKQKLEKF LELVPCGSCG GKRLRAEALA VKVGPYRITE LTSISVALTL ERIEKLMGVG
AANGSEPLLN SRQIQIGDLV LREIRMRLRF LLDVGLEYLS LDRPAMTLSG GEAQRIRLAT
QIGAGLTGVL YVLDEPSIGL HQRDNDRLLA TLRRLRDLGN TLIVVEHDED TIRAADHLVD
IGPGAGVHGG HIVVEGSLDH LLMAEESLTG AYLSGRRSIP TPRERREGSS RRLRLIDCDR
NNLKNLTVDF PLGRLVAVTG VSGSGKSTLV NELLHPALDH SLGLKVPFPQ GLGELRGVNA
IDKVIVIDQS PIGRTPRSNP ATYTGAFDPI RQVFAASVEA KARGYQVGQF SFNVKGGRCE
ACRGQGVNVI EMNFLPDVYV QCDVCKGARF NRETLQVTYK GHTIADVLQM TVEQAAEVFS
AIPQAADRLR TLVDVGLGYV KLGQPAPTLS GGEAQRVKLA TELSKRATGK TLYLIDEPTT
GLSFYDVHKL MDVMQRLVDK GNSIIVIEHN LDVIRCSDWI IDLGPEGGDC GGDLLVTGTP
EEVASHPTSH TGHYLKKVLA KHPPETVSVA A