Gene NATL1_21171 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_21171 
SymboluvrB 
ID4780244 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1772477 
End bp1774513 
Gene Length2037 bp 
Protein Length678 aa 
Translation table11 
GC content35% 
IMG OID640085414 
Productexcinuclease ABC subunit B 
Protein accessionYP_001015937 
Protein GI124026822 
COG category[L] Replication, recombination and repair 
COG ID[COG0556] Helicase subunit of the DNA excision repair complex 
TIGRFAM ID[TIGR00631] excinuclease ABC, B subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGAGT ATAAACTTCA AGCTCCTTAT ACTCCGAAAG GGGATCAACC TGCAGCGATT 
AAAGGGCTTG TTGGAGGTGT AAATGATGGG GAAAAATTTC AAACTTTATT AGGTGCAACT
GGAACAGGGA AGACTTTTAC AATAGCCAAT TTGATCGCTC AAACTGGAAG GCCTGCGTTA
GTTTTAGCTC ATAACAAAAC ACTTGCAGCT CAATTATGCA ATGAACTAAG GGAATTTTTC
CCAGATAATG CGGTTGAATA TTTTATCTCA TATTATGATT ATTATCAGCC AGAGGCTTAT
GTTCCAGTAA GTGATACTTA CATAGCCAAG ACTTCATCAA TTAATGAAGA GATTGATATG
TTACGCCACT CTGCGACTAG GTCTTTATTT GAACGAGATG ATGTCATAGT TGTTGCCTCA
ATTAGTTGTA TATATGGTTT AGGAATTCCA AGTGAATATC TAAAGGCTTC TGTAAAGTTT
CAAGTTGGTC AAAGTATTGA TTTAAGATCT TGCCTTAGAT CATTAGTTTC CAATCAATAT
ACTCGTAACG ATATCGAAAT CAGTAGAGGT AGATTTAGAG TTAGAGGAGA TGTATTAGAA
ATAGGGCCAG CATATGATGA CAGACTTGTA AGGCTTGAAT TATTTGGAGA TGAAGTAGAA
AGTATTAGCT ATGTCGATCC TACAACTGGA GAAATCTTAA ATAAACTTGA CTCAATCAAC
ATATATCCAG CAAAACATTT TGTTACTCCA AAAGATCGCC TCGATTCTGC AATTAAAGCA
ATAAAAAAAG AGTTAAAAGA TAGATTAGAA TTTCTTAATC AAGAGGGTAA ATTACTCGAG
GCTCAAAGGT TAGAACAACG AACAATTTAT GACCTAGAAA TGTTAAAAGA AGTTGGATAT
TGTAATGGTG TTGAAAATTA TGCTCGTCAT CTTTCTGGAA GAGAACCAGG CTCTGCCCCA
GAATGTCTTA TTGATTACTT CCCAAAAGAC TGGTTATTAC TTATTGATGA AAGTCATGTT
ACTTGTCCTC AATTAAGAGC TATGTATAAT GGTGATCAAG CAAGAAAGAA AGTCTTAATA
GATCATGGAT TTAGACTACC AAGTGCAGCT GATAATCGTC CTTTAAAGGA TATAGAATTT
TGGAATAAGG CAAAACAAAC TGTTTTTATA AGTGCTACTC CTGGTGATTG GGAATTGTCC
CAAAGTACAA AGAATATAGT TGAGCAAGTT ATCAGACCTA CCGGTGTTTT AGATCCTCTA
GTTGAAGTAC GTCCAACACA TGGTCAAGTT GACGATTTGC TTTTTGAAAT TAGAAAAAGA
GCATCAAAAA ATCAAAGAAT ACTTGTCACA ACGCTCACGA AAAGAATGGC TGAAGATCTT
ACAGATTATT TATCTGAAAA TAAAATAAGA GTTCGCTATT TACATTCAGA GATTCATTCA
ATTGAGAGAA TTGAAATCAT TCAAGATTTA CGATTGGGAG AATATGACGT ATTAGTTGGA
GTGAATCTTT TAAGAGAGGG TTTGGATTTA CCTGAGGTCT CTCTTGTCGT AATTCTTGAT
GCAGATAAAG AAGGATTTTT AAGAGCTCAA AGATCTTTGA TACAAACAAT AGGTCGAGCA
GCAAGACATG TAGAAGGCTT AGCTTTACTC TATGCAGATA AGATGACTGA TTCTATGGCA
AAGGCTATAA GTGAGACTGA AAGACGCAGA GAAATACAGA ATATTTATAA TATTGAGCAT
GGTATAACTC CTAAGCCAGC AGGTAAGAAA GCAAGTAATT CTATTCTTTC TTTTTTAGAG
ATCTCCAGAA GATTAAATCA AGATGGAAGT ACTGATGATT TCGTTGATAT CGCTGATAAG
TTGATTGAAC ATAGTGCTAA AGATTCTGAT AGCGGAGTAT CTTTAGAATC TTTGCCTGAA
TTAATTGAAA AATTGGAATC TAAAATGAAA ATAAAAGCTA AAGACCTAGA TTTTGAAAAA
GCAGCAATCC TTAGAGATCG TATAAAGAAA TTAAGGCATA GATTAGTCGG TAGATAA
 
Protein sequence
MPEYKLQAPY TPKGDQPAAI KGLVGGVNDG EKFQTLLGAT GTGKTFTIAN LIAQTGRPAL 
VLAHNKTLAA QLCNELREFF PDNAVEYFIS YYDYYQPEAY VPVSDTYIAK TSSINEEIDM
LRHSATRSLF ERDDVIVVAS ISCIYGLGIP SEYLKASVKF QVGQSIDLRS CLRSLVSNQY
TRNDIEISRG RFRVRGDVLE IGPAYDDRLV RLELFGDEVE SISYVDPTTG EILNKLDSIN
IYPAKHFVTP KDRLDSAIKA IKKELKDRLE FLNQEGKLLE AQRLEQRTIY DLEMLKEVGY
CNGVENYARH LSGREPGSAP ECLIDYFPKD WLLLIDESHV TCPQLRAMYN GDQARKKVLI
DHGFRLPSAA DNRPLKDIEF WNKAKQTVFI SATPGDWELS QSTKNIVEQV IRPTGVLDPL
VEVRPTHGQV DDLLFEIRKR ASKNQRILVT TLTKRMAEDL TDYLSENKIR VRYLHSEIHS
IERIEIIQDL RLGEYDVLVG VNLLREGLDL PEVSLVVILD ADKEGFLRAQ RSLIQTIGRA
ARHVEGLALL YADKMTDSMA KAISETERRR EIQNIYNIEH GITPKPAGKK ASNSILSFLE
ISRRLNQDGS TDDFVDIADK LIEHSAKDSD SGVSLESLPE LIEKLESKMK IKAKDLDFEK
AAILRDRIKK LRHRLVGR