Gene NATL1_21961 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_21961 
SymboluvrA 
ID4779270 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1856248 
End bp1859190 
Gene Length2943 bp 
Protein Length980 aa 
Translation table11 
GC content36% 
IMG OID640085494 
Productexcinuclease ABC subunit A 
Protein accessionYP_001016016 
Protein GI124026901 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAAGTC CGAAACCTAA TTCAAAAAAA AATGCATTAA ATAATGGTTC TTCTACAGAA 
GATCTCATAA CAATTCGTGG TGCAAGGCAA CATAATTTAA AAAACGTTGA TTTATCTATT
CCAAGAAATC AATTGGTTGT TTTTACAGGT GTTAGTGGAA GTGGTAAAAG TTCTTTAGCT
TTTGATACTA TTTTTGCTGA AGGGCAAAGA CGTTATGTGG AGAGTTTGTC CGCTTATGCA
AGGCAATTCT TAGGTCAGGT TGATAAACCA GACGTTGATT CAATTGAAGG TTTATCACCA
GCAATTTCAA TTGATCAAAA ATCAACAAGT CATAATCCGC GTTCAACAGT TGGAACAGTT
ACGGAGATAC AAGATTATTT TCGTTTGCTT TTTGGAAGAG CCGGGGAACC TCATTGCCCT
GAGTGCTCTA GGCCAATCAA GCCCCAAAGT ATAGATGAGA TGGTTGATCA AATTAAGACT
CTTCCTGAAG GATCTAGATA TCAATTACTT GCACCAGTCG TCAGAGGTAA AAAAGGAACA
CACTCAAAAT TATTATCTGG ACTTGCTGCA GAAGGGTTTG TTCGAGTAAG AATTAATAAA
GAAGTGAGAG AGTTGGCAGA TAATATTGAA TTAGATAAAA ATCAAGTTCA TTCTATTGAA
GTAGTGGTTG ATAGATTGAT TGCGAGAGAG GGTATTGAAG AGAGATTAAC TGATTCATTA
AGCACTACTT TGAAAAGGGG AGATGGTTTA GCAATCGTTG AAGTAGTCCC TAAAAAGAAC
GAAGAACTTC CTAAAGGTAT TGAAAAAGAA AGATTATTTT CTGAGAATTT CGCTTGTCCA
GTTCATGGGG CGGTAATTGA GGAATTATCT CCAAGATTAT TTTCATTTAA TAGCCCATAT
GGAGCTTGTC CTGATTGTCA TGGTTTAGGT CATTTGAAAA AATTTACTTG TGAGAGGGTT
GTACCTGACC CATCTTTACC GGTTTATGCT GCTGTTGCAC CCTGGAGTGA TAAAGATAAT
TCATATTATT TTTCATTATT ATTTTCAGTT GGCGAGGCAT TTGGTTTTGA AATAAAAACT
CCTTGGAAAG ATTTAAAAGA GGAGCAAAAA AATATTTTGT TAAATGGTAG TAAGGAACCT
ATTTTAATAA AAGTAGATAG TAGATATAAG CAAGATTCGG GATTTAAAAG ACCTTTTGAA
GGTATTTTGC CTATTTTAGA AAGACAATTA CAGGATGCGA ACGGAGAAGC TGTTCGCCAA
AAGTTAGAGA AATACCTTGA ACTAGTTCCA TGTGCAAGCT GTCATGGCAA GAGATTACGT
CCGGAAGCGC TTGCTGTAAA ACTTGGGCCT TTTGCCATAA CTGATCTCAC TTCATCAAGT
GTTTCTACCA CGCTTGAGAA TGTTGAAGAA TTAATGGGTA TAGAAACAAC TAATAATTCA
AAGCAATTAC TTTCTAATAA ACAAAAGAAG ATTGGGGAAT TAGTCCTCAA AGAAATTCGA
TTACGGCTTC AATTTCTGTT AGATGTTGGA CTTGATTATT TAACTTTAGA TAGGCCCGCA
ATGACTTTAT CTGGGGGGGA AGCTCAAAGA ATACGCCTTG CTACGCAAAT AGGTGCTGGT
TTAACAGGAG TTCTTTATGT GTTAGATGAA CCTAGTATTG GACTACATCA AAGAGATAAT
GATCGATTAT TGTCTACATT AAAAAAGTTA CGTGATCTAG GTAATACTTT GGTTGTTGTA
GAACATGATG AAGACACGAT AAGATCAGCT GATCATTTGG TAGACATTGG ACCTGGTGCA
GGTGTGCATG GAGGTGAAAT TATTGCTCAA GGATCTTTGG ATAATTTGTT AACAGCAAAA
AAATCATTGA CGGGTGCTTA TCTTAGTGGG CGCTCATCTA TTCCTACTCC TACTAGCAGG
AGAGATTCAG TACAAAGAAA ATTACGTTTG ATTGATTGTA ATAAAAATAA TTTAAAAAAT
GTTTCAGTTG ATTTTCCATT AGGCAGATTA GTTGCTGTCA CTGGAGTAAG TGGAAGTGGT
AAAAGTACTC TGATAAATGA ATTACTTCAT CCTGCTTTAA ATCACTCTTT AGGCTTGAAA
GTTCCTTTTC CAAAAGGATT AAAAGAACTG AAAGGTATTA AATCTATTGA TAAAGTTATT
GTCATTGACC AATCTCCAAT TGGTAGAACC CCAAGATCTA ATACTGCTAC TTACACAGGT
GCATTTGATC CTATACGCCA ACTTTTTGCA ACCTCTGTTG AAGCAAAAGC GAGAGGATAT
CAAGCAGGTC AATTCAGTTT TAACGTTAAA GGTGGAAGAT GTGAGGCTTG TCGTGGTCAA
GGCGTAAATG TAATTGAAAT GAATTTTCTT CCTGATGTTT ATGTTCAATG TGATGTATGT
AAAGGAGCTC GATTCAATCG AGAGACATTA CAAGTCAAAT ATAAAAACTA TTCTATTTCC
GATGTATTAG AAATGACTGT TGAGCAAGCG GTTGATGTTT TTTCTGCAAT ACCTCAAGCT
GCAGATCGAT TAAGGACGCT GTTAGATGTT GGGCTTGGAT ACATTAAACT TGGTCAACCT
GCTCCAACAC TATCTGGAGG AGAAGCTCAA AGAGTAAAAC TTGCCACTGA GTTATCTAGA
AGAGCTACTG GTAAAACTCT TTATTTAATC GATGAACCTA CAACTGGTTT AAGTTTTTAT
GATGTTCATA AATTAATGGA TGTTATTCAG AGATTGGTAG ATAAAGGGAA CTCAATTATT
GTTATTGAGC ATAATTTGGA TGTTATTAGG TGTTCAGATT GGGTTATTGA TATGGGACCT
GAAGGAGGTA ATCGAGGAGG CGAGATTATT GCTATGGGAA CCCCTGAAGA AGTGGCAACA
AATAAAAACA GTCATACAGG TGGTTATTTA AAAAAAGTTT TAGAAATGCA CCCTTCTAGA
TAG
 
Protein sequence
MGSPKPNSKK NALNNGSSTE DLITIRGARQ HNLKNVDLSI PRNQLVVFTG VSGSGKSSLA 
FDTIFAEGQR RYVESLSAYA RQFLGQVDKP DVDSIEGLSP AISIDQKSTS HNPRSTVGTV
TEIQDYFRLL FGRAGEPHCP ECSRPIKPQS IDEMVDQIKT LPEGSRYQLL APVVRGKKGT
HSKLLSGLAA EGFVRVRINK EVRELADNIE LDKNQVHSIE VVVDRLIARE GIEERLTDSL
STTLKRGDGL AIVEVVPKKN EELPKGIEKE RLFSENFACP VHGAVIEELS PRLFSFNSPY
GACPDCHGLG HLKKFTCERV VPDPSLPVYA AVAPWSDKDN SYYFSLLFSV GEAFGFEIKT
PWKDLKEEQK NILLNGSKEP ILIKVDSRYK QDSGFKRPFE GILPILERQL QDANGEAVRQ
KLEKYLELVP CASCHGKRLR PEALAVKLGP FAITDLTSSS VSTTLENVEE LMGIETTNNS
KQLLSNKQKK IGELVLKEIR LRLQFLLDVG LDYLTLDRPA MTLSGGEAQR IRLATQIGAG
LTGVLYVLDE PSIGLHQRDN DRLLSTLKKL RDLGNTLVVV EHDEDTIRSA DHLVDIGPGA
GVHGGEIIAQ GSLDNLLTAK KSLTGAYLSG RSSIPTPTSR RDSVQRKLRL IDCNKNNLKN
VSVDFPLGRL VAVTGVSGSG KSTLINELLH PALNHSLGLK VPFPKGLKEL KGIKSIDKVI
VIDQSPIGRT PRSNTATYTG AFDPIRQLFA TSVEAKARGY QAGQFSFNVK GGRCEACRGQ
GVNVIEMNFL PDVYVQCDVC KGARFNRETL QVKYKNYSIS DVLEMTVEQA VDVFSAIPQA
ADRLRTLLDV GLGYIKLGQP APTLSGGEAQ RVKLATELSR RATGKTLYLI DEPTTGLSFY
DVHKLMDVIQ RLVDKGNSII VIEHNLDVIR CSDWVIDMGP EGGNRGGEII AMGTPEEVAT
NKNSHTGGYL KKVLEMHPSR