Gene P9515_19031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9515_19031 
SymboluvrA 
ID4720137 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9515 
KingdomBacteria 
Replicon accessionNC_008817 
Strand
Start bp1695765 
End bp1698665 
Gene Length2901 bp 
Protein Length966 aa 
Translation table11 
GC content34% 
IMG OID640081604 
Productexcinuclease ABC subunit A 
Protein accessionYP_001012217 
Protein GI123967136 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTAAAA AAAATATTGG TATCAATGAA GATAATTCCA TAAAAATCCG AGGTGCTCGA 
CAACATAATT TAAAAAATAT AGATTTAACT CTTCCAAGAA ATAAATTTAT TGTTTTTACA
GGTGTTAGCG GTAGTGGTAA AAGCTCTTTA GCTTTTGATA CTATTTTTGC AGAAGGTCAA
AGAAGATATG TAGAAAGCCT TTCTGCTTAT GCCAGACAGT TTTTAGGTCA AGTTGACAAA
CCAGATGTAG ATAATATTGA GGGATTATCC CCTGCTATAT CTATAGATCA AAAATCAACG
AGTCATAATC CTCGATCAAC TGTTGGAACA GTTACTGAGA TACAAGATTA TTTAAGATTA
TTATTTGGAC GCGCTGGAGA ACCTCATTGT CATCATTGTG GGTTGCCAAT TGCTCCGCAA
ACAATTGATG AAATGGTTGA TCAAATAGTT TTGTTGCCTG AAGGAACAAG GTATCAATTG
CTTGCACCTG TCGTAAGAGG TAAAAAGGGA ACTCATGCAA AATTACTTAG CGGATTAGCC
GCAGAAGGTT TTGCTCGAGT AAGAATTAAT GGTGAGGTTA GAGAGCTTGC AGATAGTATT
GAGTTAGATA AGAACCATAT TCATAATATC GAGGTTGTAG TAGACCGATT GATTGCTAGA
GATGGAATTC AGGAAAGATT GAATGACTCT TTACAAACTT GCCTTAAAAG AGGAGATGGG
CTTTCTATTG TTGAAGTTGT TCCTAAAAAA GGGGAAACTT TACCTCCCGA GTTAGATAAA
GAAAAATTAT ATTCTGAGAA CTATGCATGT CCAATTCATG GATCTATTGT TGAAGAATTG
TCTCCAAGAT TATTTTCTTT TAATAGCCCT TATGGTGCTT GTTCCGATTG TCATGGTATT
GGCTATTTAA AAAAATTTAC TGCAGATAGA GTAATACCTG ATACTTCTTT ACCTGTTTAC
GCTGCCATAG CTCCTTGGAG CGAAAAAGAT AATACATATT ATTTTTCGTT ACTTTATTCA
GTAGGACAAG CTTATGGTTT TGAATTAAAA ACTCCTTGGA AAGATCTCAG TGATTTACAA
AAAAATGTCT TACTTTCGGG ATCTGATAAG CCAATTTTAA TTCAAGCTGA TAGTCGCTTT
AAAACTTCTA GCGGTTTTGA AAGACCTTTT GAGGGAATTT TGCCAATCCT AGAAAGGCAA
TTAAGTGAAT CTAATGGAGA ATCAGTTAAA CAAAAGCTTG AAAAATATTT AGAATTAGTT
CCTTGTAAAA CATGTTCAGG TAAGAGATTA AAACCAGAAG CACTCGCTGT TAAACTTGGC
CCCTATAACA TCACTGATTT AACTTCTATT AGCGTCTCAG AAACTTTATC TCATATTGAG
AAAATAATGG GTTTAAGTAA AAATAAAACA GAAATTGTTC TCTTGTCATC TAAACAAAAA
CAAATAGGTG AATTAGTTTT AAAAGAAATT CGATTACGTT TAAAGTTTCT TATTAATGTA
GGTCTGGATT ATTTGACACT AGATAGACCA GCCATGACTT TGTCTGGTGG AGAAGCTCAG
CGTATTAGAT TAGCAACTCA AATAGGAGCA GGTTTAACTG GTGTTTTATA TGTATTGGAT
GAACCAAGTA TCGGCTTACA TCAAAGAGAT AATGATAGGT TGCTTGAAAC ATTAAAAAAT
CTCAGAGATT TAGGTAATAC TCTTGTTGTT GTAGAACATG ATGAAGATAC TATGAAATCT
GCAGATTACT TAGTGGATAT AGGGCCCGGT GCAGGCGTTT ATGGAGGTGA AATTATTGCA
AAAGGTACAT TTGAGGATGT TTTAAACTCA GAAAAGTCTT TGACTGGCGC TTATCTCAGT
GGGAGGAAGT CTATTCCTAC ACCAAGTGTT AGAAGATCTT CAGTTAAGAA AAGTCTTATA
CTAAATAATT GTGTTAAGAA TAATCTCAAG GATATATCAG TTGAATTTCC TTTGGGAAGA
TTAGTTTCTG TAACTGGTGT AAGTGGTAGT GGGAAAAGTA CATTAGTTAA TGAATTATTA
CATCCTGCAC TTTCACACTC TCTTGGTTTA AAAGTTCCTT TTCCTAAGGG TGTTAAAGAA
TTAAAAGGTA TAAAAGCTAT TGATAAAGTA ATTGTTATTG ATCAATCTCC TATTGGAAGA
ACTCCTAGAT CAAACCCTGC TACTTATACA GGTGCTTTCG ATCCGATTAG ACAATTGTTT
ACTGCCACTG TAGAAGCTAA AGCTAGAGGT TATCAAGCTG GCCAATTTAG CTTTAATGTT
AAAGGGGGTA GATGCGAGGC TTGTAGAGGT CAAGGAGTTA ATGTCATTGA AATGAATTTT
TTGCCTGATG TTTATGTTCA ATGTGATGTA TGTAAAGGTG CTCGTTTTAA TAGAGAAACG
CTTCAAGTTA AATATAAAGG TTTTAATATA TCAGATGTCT TAGAAATGAC TGTTGAGCAA
GCAGCTGAAA CTTTCTCTGC TATTCCTCAA GCTGCAGATA GATTGTCTAC TTTAGTTGAT
GTAGGACTTG GTTACGTTAA ATTAGGCCAG CCAGCTCCAA CATTATCTGG TGGAGAGGCA
CAAAGAGTAA AATTAGCAAC TGAATTATCA AAGAGAGCTA CTGGAAAAAC TTTATATTTA
ATTGATGAGC CAACAACGGG ATTAAGTTTT TATGATGTGC ATAAATTAAT GGATGTTATT
CAACGTTTAG TAGATAAAGG TAATTCTGTA GTAGTTATTG AGCATAATTT AGATGTTATT
AGATGTTCAG ATTGGATAAT TGATCTTGGA CCTGATGGAG GTGATAAAGG TGGAGAAATC
ATCGTGGAGG GTACACCTGA AGATGTTGCG AAGCATGCAA CAAGTTATAC AGCCAAATAT
TTAAAGCAAG CTCTAAATTG A
 
Protein sequence
MVKKNIGINE DNSIKIRGAR QHNLKNIDLT LPRNKFIVFT GVSGSGKSSL AFDTIFAEGQ 
RRYVESLSAY ARQFLGQVDK PDVDNIEGLS PAISIDQKST SHNPRSTVGT VTEIQDYLRL
LFGRAGEPHC HHCGLPIAPQ TIDEMVDQIV LLPEGTRYQL LAPVVRGKKG THAKLLSGLA
AEGFARVRIN GEVRELADSI ELDKNHIHNI EVVVDRLIAR DGIQERLNDS LQTCLKRGDG
LSIVEVVPKK GETLPPELDK EKLYSENYAC PIHGSIVEEL SPRLFSFNSP YGACSDCHGI
GYLKKFTADR VIPDTSLPVY AAIAPWSEKD NTYYFSLLYS VGQAYGFELK TPWKDLSDLQ
KNVLLSGSDK PILIQADSRF KTSSGFERPF EGILPILERQ LSESNGESVK QKLEKYLELV
PCKTCSGKRL KPEALAVKLG PYNITDLTSI SVSETLSHIE KIMGLSKNKT EIVLLSSKQK
QIGELVLKEI RLRLKFLINV GLDYLTLDRP AMTLSGGEAQ RIRLATQIGA GLTGVLYVLD
EPSIGLHQRD NDRLLETLKN LRDLGNTLVV VEHDEDTMKS ADYLVDIGPG AGVYGGEIIA
KGTFEDVLNS EKSLTGAYLS GRKSIPTPSV RRSSVKKSLI LNNCVKNNLK DISVEFPLGR
LVSVTGVSGS GKSTLVNELL HPALSHSLGL KVPFPKGVKE LKGIKAIDKV IVIDQSPIGR
TPRSNPATYT GAFDPIRQLF TATVEAKARG YQAGQFSFNV KGGRCEACRG QGVNVIEMNF
LPDVYVQCDV CKGARFNRET LQVKYKGFNI SDVLEMTVEQ AAETFSAIPQ AADRLSTLVD
VGLGYVKLGQ PAPTLSGGEA QRVKLATELS KRATGKTLYL IDEPTTGLSF YDVHKLMDVI
QRLVDKGNSV VVIEHNLDVI RCSDWIIDLG PDGGDKGGEI IVEGTPEDVA KHATSYTAKY
LKQALN