Gene A9601_19211 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_19211 
SymboluvrA 
ID4718661 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp1661473 
End bp1664376 
Gene Length2904 bp 
Protein Length967 aa 
Translation table11 
GC content35% 
IMG OID640079656 
Productexcinuclease ABC subunit A 
Protein accessionYP_001010310 
Protein GI123969453 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTAATA AGGTTGATAG TAGTTTTGGA GAAGATAACT CAATCAATAT TAGAGGAGCT 
CGACAGCATA ATTTAAAAAA TATTGATCTT TCTCTACCTA GGAACAAATT TATAGTTTTT
ACAGGTGTGA GTGGAAGTGG TAAAAGTTCT TTAGCCTTTG ATACTATTTT TGCTGAAGGT
CAAAGAAGAT ATGTTGAAAG TCTTTCGGCA TACGCAAGGC AGTTTTTGGG TCAAGTAGAT
AAACCAGATG TTGACAATAT TGAAGGTTTA TCACCTGCTA TTTCAATTGA TCAAAAATCT
ACAAGTCATA ATCCTCGATC AACAGTTGGA ACAGTAACAG AGATACAAGA TTATTTAAGA
TTATTGTTTG GTCGTGCTGG TGAGCCTCAT TGTCACCACT GCGGGATTCC AATAGCGCCG
CAAACAATTG ATGAAATGGT TGATCAAATT CTTCTCTTGC CAGAAGGAAC AAGGTACCAA
TTGTTGGCTC CTGTTGTAAG AGGAAAAAAA GGAACACATA CAAAATTAAT AAGTGGACTA
GCTGCTGAAG GATTTGCTAG GGTAAGAATC AACGGAGAGG TAAGAGAACT TGCTGATAGT
ATTGAATTAG ATAAAAATCA AATTCATAAT ATTGAGGTAG TAGTTGATAG ATTAATTGCA
AGAGATGGAA TACAAGAAAG ATTAAATGAT TCTCTACAAA CTTGTCTTAA AAGAGGTGAT
GGCCTAGCAA TAGTAGAAGT TGTTCCAAAA AAAGGAGAAA ACTTACCTTC TAACTTGGAG
AGAGAAAAAC TTTACTCAGA AAATTATGCA TGTCCTGTGC ATGGCTCTAT TGTTGAAGAA
CTTTCTCCTA GATTATTTTC TTTTAATAGC CCATATGGGG CATGTCCAGA TTGTCATGGG
ATTGGTTATT TAAAAAAATT TACTGCGGAT AGAGTTATAC CTGATAAAAC ATTGCCTGTT
TATGCTGCAA TAGCTCCTTG GAGTGAAAAA GATAATACTT ATTACTTCTC TTTACTTTAT
TCCGTAGGAC AAGCCTATGG TTTTGAATTA AAAACTCCTT GGAAAGATTT AAGTGATTTG
CAAAAACAAG TTCTGCTTTT GGGATCAGAT AAACCAATTT TAATTCAAGC TGATAGTCGT
TTTAAAACTT CTAGTGGTTT TGAAAGACCT TTTGAGGGGA TTTTACCAAT ATTAGAAAGG
CAATTGAATG AAGCCAATGG AGAATCAGTT AAACAAAAAT TAGAAAAGTA TCTAGAATTA
GTTCCCTGTA AGACATGTTC TGGAAAAAGA TTAAGACCTG AGGCTTTGGC TGTTAAACTT
GGCCCATACA ACATTACTGA TTTAACTTCT ATAAGCGTTT CTGAAACCTT AAATCACGTA
GAGCGCATCA TGGGTTTAGG TAAGACAAAG AAAGAAAATA TATCTTTATC AGAAAAACAA
AAGCAGATTG GTGAATTGGT ATTAAAAGAG ATTCGTTTAC GTTTGAAGTT TTTAATTAAT
GTAGGTTTAG ATTATTTGAC TTTAGACAGA CCTGCTATGA CTTTGTCTGG TGGTGAGGCT
CAGCGTATTA GATTGGCCAC ACAAATAGGT GCAGGTCTTA CTGGCGTTTT ATATGTATTA
GATGAACCAA GTATTGGCTT GCATCAGAGA GACAATGACA GATTATTAGA AACATTAAAA
AGCTTAAGAG ACTTGGGAAA TACTTTGGTC GTCGTTGAAC ATGACGAAGA TACTATGAAA
TCCGCAGATT ATTTAGTAGA TATTGGTCCA GGGGCAGGTG TTTATGGTGG GGAAATTATT
GCTAAAGGAT CTTATCAAGA TGTCTTAAAT TCCGAAAAGT CATTAACTGG AGCTTATCTC
AGTGGTAGGA AGTCGATTCC TACTCCAAAA GAACGTAGAT CATCTGTAAA AAAAAGTTTA
ATTTTAAATA ATTGTATTAA AAATAATTTA AAAAATATTT CTGTTGAATT TCCTTTAGGA
AGATTAGTTT CTGTAACTGG TGTGAGTGGA AGTGGGAAGA GCACTTTGAT AAATGAATTA
CTTCATCCTG CATTATGTCA TTCTCTAGGA TTAAAAGTCC CTTTCCCGCA AGGAGTAAAA
GAGTTAAAGG GTATAAAGGC AATTGATAAA GTTATCGTAA TTGATCAATC TCCTATAGGA
AGAACTCCAA GATCAAATCC TGCTACATAT ACCGGTGCTT TTGATCCTAT ACGGCAGATA
TTTACTGCCA CAGTAGAAGC AAAAGCAAGA GGTTATCAAG CTGGTCAATT CAGCTTTAAC
GTGAAGGGTG GAAGATGCGA AGCTTGTAAA GGTCAGGGAG TCAATGTAAT TGAAATGAAT
TTTTTACCTG ATGTCTATGT TCAATGTGAA GTATGTAAAG GAGCTCGTTT TAATAGGGAA
ACTCTTCAGG TGAAATATAA AGGTTTCAAT ATATCTGATG TTTTAGAGAT GACTGTTGAA
CAAGCTGCAG AAACTTTCTC AGCAATACCT CAAGCTGCGG ACAGATTATC TACATTGGTT
GATGTTGGAT TAGGATATGT CAAATTAGGC CAGCCAGCTC CTACATTATC TGGAGGAGAG
GCTCAAAGAG TTAAGTTAGC CACGGAATTG TCAAAAAGGG CTACTGGAAA AACTTTATAT
TTGATTGATG AACCAACGAC AGGGTTAAGT TTTTATGATG TTCATAAATT AATGGATGTG
ATACAACGTT TGGTAGATAA AGGTAATTCA GTAATTGTTA TTGAACATAA TTTAGATGTT
ATTAGATGTT CAGATTGGAT TATCGATTTA GGACCTGATG GAGGGGATAA AGGAGGAGAA
ATTATTGCAG AAGGTATTCC TGAGGATGTA GCTAAAAATC CTACAAGTCA TACAGCAAAA
TATCTTAAAA AGGTTCTGAA ATAA
 
Protein sequence
MVNKVDSSFG EDNSINIRGA RQHNLKNIDL SLPRNKFIVF TGVSGSGKSS LAFDTIFAEG 
QRRYVESLSA YARQFLGQVD KPDVDNIEGL SPAISIDQKS TSHNPRSTVG TVTEIQDYLR
LLFGRAGEPH CHHCGIPIAP QTIDEMVDQI LLLPEGTRYQ LLAPVVRGKK GTHTKLISGL
AAEGFARVRI NGEVRELADS IELDKNQIHN IEVVVDRLIA RDGIQERLND SLQTCLKRGD
GLAIVEVVPK KGENLPSNLE REKLYSENYA CPVHGSIVEE LSPRLFSFNS PYGACPDCHG
IGYLKKFTAD RVIPDKTLPV YAAIAPWSEK DNTYYFSLLY SVGQAYGFEL KTPWKDLSDL
QKQVLLLGSD KPILIQADSR FKTSSGFERP FEGILPILER QLNEANGESV KQKLEKYLEL
VPCKTCSGKR LRPEALAVKL GPYNITDLTS ISVSETLNHV ERIMGLGKTK KENISLSEKQ
KQIGELVLKE IRLRLKFLIN VGLDYLTLDR PAMTLSGGEA QRIRLATQIG AGLTGVLYVL
DEPSIGLHQR DNDRLLETLK SLRDLGNTLV VVEHDEDTMK SADYLVDIGP GAGVYGGEII
AKGSYQDVLN SEKSLTGAYL SGRKSIPTPK ERRSSVKKSL ILNNCIKNNL KNISVEFPLG
RLVSVTGVSG SGKSTLINEL LHPALCHSLG LKVPFPQGVK ELKGIKAIDK VIVIDQSPIG
RTPRSNPATY TGAFDPIRQI FTATVEAKAR GYQAGQFSFN VKGGRCEACK GQGVNVIEMN
FLPDVYVQCE VCKGARFNRE TLQVKYKGFN ISDVLEMTVE QAAETFSAIP QAADRLSTLV
DVGLGYVKLG QPAPTLSGGE AQRVKLATEL SKRATGKTLY LIDEPTTGLS FYDVHKLMDV
IQRLVDKGNS VIVIEHNLDV IRCSDWIIDL GPDGGDKGGE IIAEGIPEDV AKNPTSHTAK
YLKKVLK