Gene P9301_19021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9301_19021 
SymboluvrA 
ID4912449 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9301 
KingdomBacteria 
Replicon accessionNC_009091 
Strand
Start bp1633466 
End bp1636369 
Gene Length2904 bp 
Protein Length967 aa 
Translation table11 
GC content34% 
IMG OID640161508 
Productexcinuclease ABC subunit A 
Protein accessionYP_001092126 
Protein GI126697240 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTAATA AAGTTGATAG TAGTTTTGAA GAAGATAATT CAATTAATAT TAGAGGTGCT 
CGTCAGCATA ATTTAAAAAA TATTGATCTT TCTCTACCTA GGAATAAATT TATAGTTTTT
ACGGGTGTTA GTGGAAGTGG TAAAAGTTCT TTAGCCTTTG ATACTATTTT TGCTGAAGGT
CAGAGAAGAT ATGTTGAGAG TCTTTCTGCA TATGCAAGAC AATTTTTGGG TCAAGTAGAT
AAACCAGATG TTGACAATAT TGAGGGTTTA TCACCTGCCA TTTCAATTGA TCAAAAATCT
ACAAGTCATA ATCCTCGATC AACAGTTGGA ACAGTAACAG AGATACAAGA TTATTTAAGA
TTATTGTTTG GTCGTGCTGG TGAGCCGCAT TGTCACCACT GCGGGATTCC AATTTCGCCT
CAAACAATTG ATGAAATGGT GGATCAAATT CTTCTTTTGC CAGAAGGAAC AAGGTACCAA
TTGTTGGCTC CTGTTGTAAG AGGTAAGAAA GGAACACATA CAAAATTAAT AAGTGGACTA
GCTGCTGAAG GATTTGCCAG AGTTAGAATC AATGGGGAAG TAAGAGAACT TGCTGATAGT
ATTGAATTAG ATAAAAATCA AATTCATAAT ATTGAGGTAG TAGTTGATAG ATTAATTGCA
AGAGAAGGAA TACAAGAAAG ATTAAATGAT TCTCTTCAAA CTTGTCTCAA AAGAGGAGAT
GGCTTAGCAA TAGTAGAAGT TGTCCCAAAA AAAGGAGAAA ATTTACCTTC TAACTTAGAG
AGAGAGAAAC TTTACTCAGA AAATTATGCA TGTCCTGTAC ATGGCTCTAT TGTTGAAGAA
CTTTCTCCTA GATTATTTTC TTTTAATAGC CCATATGGTG CTTGTCCAGA TTGTCATGGG
ATTGGTTATT TAAAAAAATT TACTGCGGAT AGAGTTATAC CTGATAAAAC ATTGCCTGTT
TATGCTGCAA TAGCTCCTTG GAGTGAGAAA GATAATACTT ATTATTTCTC ATTACTTTAT
TCTGTAGGAC AAGCTTATGG TTTTGAATTA AAAACTCCTT GGAAAGATTT AAGTGATTTG
CAAAAACAAG TTCTACTTTT GGGATCAGAT AAACCAATAT TAATTCAAGC TGATAGTCGT
TTTAAAACTT CTAGTGGTTT TGAAAGACCT TTTGAAGGAA TTTTACCAAT TTTAGAAAGG
CAATTCAATG AAGCCAATGG TGAATCAGTT AAACAAAAAT TAGAAAAGTA TCTAGAATTA
GTTCCCTGTA AGACATGTTC TGGAAAAAGA TTAAGACCTG AGGCTTTGGC CGTTAAAATT
GGTCCATATA ATATTACTGA CTTAACTTCT ATAAGTGTTT CTGAAACCCT AAATCACATA
GAATTCCTCA TGGGTTTGAG TAATACAAAG AAAAAAAATA TATCTTTATC AGAAAAACAA
AAGCAGATAG GTGAATTGGT TTTAAAAGAG ATTCGTTTAC GTTTGAAGTT TTTAATTAAT
GTAGGCTTAG ATTATTTAAC TTTAGATAGA CCAGCTATGA CTTTGTCTGG TGGTGAGGCT
CAGCGTATTA GATTGGCTAC ACAAATAGGA GCAGGTCTTA CTGGGGTTTT GTATGTATTA
GATGAACCAA GTATAGGTTT GCATCAGAGA GACAATGACA GATTATTAGA AACATTAAAA
AGCTTAAGAG ACTTGGGAAA TACTTTGGTT GTTGTTGAAC ATGATGAAGA TACTATGAAA
TCCGCAGATT ATTTAGTAGA TATTGGTCCA GGGGCAGGTG TTTATGGTGG GGAAATTATT
GCTAAGGGAT CTTATCAAGA TGTCTTAAAT TCAGAAAAGT CATTAACTGG AGCTTATCTC
AGTGGTAGAA AGTCGATTCC TACTCCAAAA GAACGTAGAT CATCTGTAAA AAAAAGTTTA
ATTTTAAATA ATTGCTCTAA AAATAATTTA AAAAATATTT CTGTTGAATT TCCTTTAGGA
AGGTTAGTTT CTATTACTGG CGTGAGTGGA AGTGGGAAGA GTACTTTGAT AAATGAATTA
CTTCATCCTG CATTATGTCA TTCTTTAGGA TTAAAAGTCC CTTTTCCTCA AGGCGTAAAA
GAGTTAAAGG GTATAAAGGC AATTGATAAA GTTATCGTTA TTGATCAATC TCCAATAGGA
AGAACTCCAA GATCAAATCC TGCTACATAT ACTGGTGCTT TTGATCCTAT AAGGCAGATA
TTTACTGCTA CAGTTGAAGC AAAAGCAAGA GGCTATCAGG CTGGTCAATT TAGCTTTAAT
GTGAAAGGAG GAAGATGCGA AGCTTGTAAG GGTCAGGGAG TAAATGTAAT TGAAATGAAC
TTTTTACCTG ATGTGTATGT TCAATGTGAA GTATGTAAAG GAGCTCGTTT TAATAGGGAA
ACTCTTCAGG TGAAATATAA AGGTTTCAAT ATATCTGATG TCTTAGAGAT GACTGTTGAA
CAAGCTGCAG AAACATTCTC TGCAATACCT CAAGCTGCTG ATAGATTATC TACATTGGTA
GATGTCGGTT TAGGATATGT CAAATTAGGT CAACCAGCTC CTACATTATC CGGTGGAGAG
GCTCAAAGAG TTAAGTTAGC TACGGAATTG TCCAAAAGGG CTACTGGAAA AACTTTATAT
TTGATTGATG AACCAACTAC AGGATTAAGT TTTTATGATG TTCATAAATT AATGGATGTG
ATACAACGTT TGGTAGATAA AGGTAATTCA GTAATTGTTA TTGAACATAA TTTAGATGTT
ATTAGATGTT CAGATTGGAT TATCGATTTA GGTCCTGATG GAGGGGATAA AGGAGGAGAA
ATCATTGCAG AAGGTATTCC TGAGGATGTA GCTAAAAATC CTAGAAGTCA TACAGCAAAA
TATCTTAAAA AGGTCTTAAA TTAA
 
Protein sequence
MVNKVDSSFE EDNSINIRGA RQHNLKNIDL SLPRNKFIVF TGVSGSGKSS LAFDTIFAEG 
QRRYVESLSA YARQFLGQVD KPDVDNIEGL SPAISIDQKS TSHNPRSTVG TVTEIQDYLR
LLFGRAGEPH CHHCGIPISP QTIDEMVDQI LLLPEGTRYQ LLAPVVRGKK GTHTKLISGL
AAEGFARVRI NGEVRELADS IELDKNQIHN IEVVVDRLIA REGIQERLND SLQTCLKRGD
GLAIVEVVPK KGENLPSNLE REKLYSENYA CPVHGSIVEE LSPRLFSFNS PYGACPDCHG
IGYLKKFTAD RVIPDKTLPV YAAIAPWSEK DNTYYFSLLY SVGQAYGFEL KTPWKDLSDL
QKQVLLLGSD KPILIQADSR FKTSSGFERP FEGILPILER QFNEANGESV KQKLEKYLEL
VPCKTCSGKR LRPEALAVKI GPYNITDLTS ISVSETLNHI EFLMGLSNTK KKNISLSEKQ
KQIGELVLKE IRLRLKFLIN VGLDYLTLDR PAMTLSGGEA QRIRLATQIG AGLTGVLYVL
DEPSIGLHQR DNDRLLETLK SLRDLGNTLV VVEHDEDTMK SADYLVDIGP GAGVYGGEII
AKGSYQDVLN SEKSLTGAYL SGRKSIPTPK ERRSSVKKSL ILNNCSKNNL KNISVEFPLG
RLVSITGVSG SGKSTLINEL LHPALCHSLG LKVPFPQGVK ELKGIKAIDK VIVIDQSPIG
RTPRSNPATY TGAFDPIRQI FTATVEAKAR GYQAGQFSFN VKGGRCEACK GQGVNVIEMN
FLPDVYVQCE VCKGARFNRE TLQVKYKGFN ISDVLEMTVE QAAETFSAIP QAADRLSTLV
DVGLGYVKLG QPAPTLSGGE AQRVKLATEL SKRATGKTLY LIDEPTTGLS FYDVHKLMDV
IQRLVDKGNS VIVIEHNLDV IRCSDWIIDL GPDGGDKGGE IIAEGIPEDV AKNPRSHTAK
YLKKVLN