Gene P9211_18501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_18501 
SymboluvrA 
ID5730184 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp1680489 
End bp1683428 
Gene Length2940 bp 
Protein Length979 aa 
Translation table11 
GC content39% 
IMG OID641286237 
Productexcinuclease ABC subunit A 
Protein accessionYP_001551735 
Protein GI159904391 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.95453 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0541711 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCCTT CATTGGATAA AAATAAAAGA AAATCTTTAA CCATTGAATC GCATGAAGAT 
GTAATACGAA TTCGTGGTGC GCGTCAACAT AATCTAAAGA ACATTGATTT AACGATTCCA
AGGAATAGTT TTATTGTTTT TACAGGTGTA AGCGGTAGTG GTAAAAGCTC CTTGGCCTTT
GACACTATTT TTGCTGAGGG CCAACGCCGT TATGTTGAAA GTCTTTCTGC TTATGCACGT
CAATTTTTAG GTCAAGTTGA TAAACCTGAT GTAGATGCCA TAGAAGGCTT ATCTCCAGCA
ATTTCAATCG ATCAAAAATC AACCAGTCAT AACCCACGTT CAACTGTTGG GACAGTTACT
GAAATACAGG ATTATTTACG TTTGTTGTTT GGTAGAGCAG GAGAGCCTCA TTGCCCTGAA
TGTCATAGAC CTATTAAGCC TCAAACCATA GATGAAATGG TTGATCAAAT ACTTTCTTTG
CCTGAAGGCA CTCGATATCA ACTACTTTCT CCCGTTGTAA GAGGCAAAAA AGGCACTCAT
GCCAAATTAT TGTCGGGATT AGCTTCAGAA GGCTTTGCTC GGGTACGAAT TAATAAAGAG
GTACGAGAAT TGTCAGATAA TATTGAACTT GATAAAAATC ATTCTCATTC AATAGAAGTT
GTTGTAGATC GTTTAATTGC CAGAGAAGGT ATTCAAGAAC GTTTAACAGA CTCTTTACGA
ACGGCATTGA AGCGGGGAGA AGGTTTGGCT TTAATTGAGG TGGTGCCTAA AAAGAATGAG
GAGCTACCCA AAAATTTAGA ACGAGAAAGA CTTTTTTCGG AAAATTTTGC GTGCCCTATT
CATGGAGCTG TTATTGAAGA ACTCTCTCCT CGATTATTTT CTTTTAATAG TCCTTATGGC
GCATGCCCAG AATGTCATGG GATAGGGCAT TTGAAAAAGT TTACTCTTGA TCGAGTAGTA
CCTGACCCTT CTCTGCCTGT ATATGCTGCA GTTGCTCCGT GGAGTGATAA GGATAACTCT
TATTATTTTT CTTTGCTGTA TTCAGTTGGT GAAGCATTTG GTTTTGAAAT TAAAACGCCA
TGGAAAGAAT TAACCCAAGA GCAGCAAGAT GTTTTGCTTA ATGGTTCTGA AAAACCAATT
TTGATACAAG CTGATAGTAG ATATAAGCAA AAAGGGGGGT TTAAACGTCC TTTTGAAGGC
ATTCTTCCTA TTTTGGAAAG GCAATTAAGT GATGCAAATG GAGAATCAGT TCGACAGAAA
TTAGAAAAAT ATTTAGAGTT AGTACCTTGT TCTACTTGCT CGGGTAAAAG ACTTCGACCA
GAGGCTTTAG CTGTCAAGAT AGGACCATAT GCAATTAATG AATTTACAGA AACGAGCGTT
TCGCAAACTC TTGAACGTAT TGAACAATTA ATGGGTGTAG GAGAGTCAAC TGCAGCTTCT
ACTCCTTTGT TAACTCCTCG GCAAATTCAA ATTGCAGATT TGGTTTTAAG AGAAATTCGC
TTACGATTGA AGTTTCTTCT TGATGTTGGC CTTGATTATC TTTCTTTAGA TAGACCTGCG
ATGACTTTAT CTGGAGGAGA GGCTCAAAGG ATTCGCTTAG CAACTCAGAT TGGAGCTGGT
TTGACAGGTG TTCTTTATGT ATTGGATGAG CCGAGTATTG GATTGCATCA AAGAGATAAT
GATCGGCTAT TGACTACTCT TCAGAGATTA CGTGATTTGG GTAATACACT TATTGTTGTG
GAGCATGATG AAGACACTAT TCGTGCTGCT GATTATTTGG TTGATATTGG TCCTGGAGCT
GGTGTTCATG GAGGTGAGAT AATAGCTGAG GGATCAATAG ATAACTTATT AGCTGCAAGT
AAATCTTTGA CTGGTGCATA TTTAAGTGGC CGTTCGGCAA TACCGACACC TAAAGAACGT
CGAAAAAGTG GTAAGAAACA TCTTCGTTTA ATTAATTGTG ACCGTAATAA TTTACAGAAT
ATATCAGTAG ATTTTCCTTT AGGTCGTTTG GTAGCTGTTA CAGGAGTGAG TGGAAGTGGT
AAAAGCACTT TGGTTAATGA ATTGTTGCAT CCTGCTATTA ATCATGAATT AGGTTTGAAA
GTTCCTTTTC CGAAAGGTAT GAAGGAGATC CGTGGAATTA ATGCTATTGA TAAAGTAATT
GTGATTGATC AATCACCTAT TGGTAGAACA CCGCGATCTA ATCCTGCTAC CTATACAGGT
GCTTTTGATC CAATTAGGCA AATTTTTGCA GCTGCCGTGG AGGCTAAAGC TAGAGGATAT
CAAGTTGGTC AATTTAGCTT TAATGTTAAA GGGGGTCGAT GTGAAGCTTG CCGAGGTCAA
GGTGTAAATG TAATTGAAAT GAATTTTTTG CCAGATGTTT ATGTTCAGTG TGATGTTTGT
AAGGGTGCTC GATTTAATAG AGAAACTTTG CAAGTCAAGT ATAAAGGCTA CACAATTGCC
GATGTATTAG AGATGACTGT AGAGCAATCA GTAGATGTAT TCTCTGCAAT TCCTCAAGCT
GCTGATAGGT TAAGAACCTT AGTCGATGTT GGCCTTGGTT ATATAAAGTT AGGCCAACCG
GCACCAACAC TTTCAGGAGG TGAAGCACAG CGAGTCAAAC TTGCCACAGA ATTATCTCGT
AGGGCAACAG GCAAAACTGT GTATTTGATA GATGAGCCAA CTACAGGACT TAGTTTTTAT
GACGTTCATA AATTAATGGA TGTGATTCAA CGGCTAGTAG ATAAAGGGAA TTCCATTATT
GTTATTGAAC ATAATCTTGA TGTTATTAGA TGCTCGGATT GGATTATTGA TTTGGGCCCT
GATGGTGGTG ATCGTGGAGG CAATGTCATT GCTACTGGGA CTCCCGAAGA AGTGGCTGAA
CACTCAGATA GTTATACCGG CAGCTATTTG AAGAATGTTT TAGCAAAACA TCCCCCTTAG
 
Protein sequence
MSPSLDKNKR KSLTIESHED VIRIRGARQH NLKNIDLTIP RNSFIVFTGV SGSGKSSLAF 
DTIFAEGQRR YVESLSAYAR QFLGQVDKPD VDAIEGLSPA ISIDQKSTSH NPRSTVGTVT
EIQDYLRLLF GRAGEPHCPE CHRPIKPQTI DEMVDQILSL PEGTRYQLLS PVVRGKKGTH
AKLLSGLASE GFARVRINKE VRELSDNIEL DKNHSHSIEV VVDRLIAREG IQERLTDSLR
TALKRGEGLA LIEVVPKKNE ELPKNLERER LFSENFACPI HGAVIEELSP RLFSFNSPYG
ACPECHGIGH LKKFTLDRVV PDPSLPVYAA VAPWSDKDNS YYFSLLYSVG EAFGFEIKTP
WKELTQEQQD VLLNGSEKPI LIQADSRYKQ KGGFKRPFEG ILPILERQLS DANGESVRQK
LEKYLELVPC STCSGKRLRP EALAVKIGPY AINEFTETSV SQTLERIEQL MGVGESTAAS
TPLLTPRQIQ IADLVLREIR LRLKFLLDVG LDYLSLDRPA MTLSGGEAQR IRLATQIGAG
LTGVLYVLDE PSIGLHQRDN DRLLTTLQRL RDLGNTLIVV EHDEDTIRAA DYLVDIGPGA
GVHGGEIIAE GSIDNLLAAS KSLTGAYLSG RSAIPTPKER RKSGKKHLRL INCDRNNLQN
ISVDFPLGRL VAVTGVSGSG KSTLVNELLH PAINHELGLK VPFPKGMKEI RGINAIDKVI
VIDQSPIGRT PRSNPATYTG AFDPIRQIFA AAVEAKARGY QVGQFSFNVK GGRCEACRGQ
GVNVIEMNFL PDVYVQCDVC KGARFNRETL QVKYKGYTIA DVLEMTVEQS VDVFSAIPQA
ADRLRTLVDV GLGYIKLGQP APTLSGGEAQ RVKLATELSR RATGKTVYLI DEPTTGLSFY
DVHKLMDVIQ RLVDKGNSII VIEHNLDVIR CSDWIIDLGP DGGDRGGNVI ATGTPEEVAE
HSDSYTGSYL KNVLAKHPP