Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_30191 |
Symbol | uvrA |
ID | 4776864 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 2674059 |
End bp | 2677034 |
Gene Length | 2976 bp |
Protein Length | 991 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640088543 |
Product | excinuclease ABC subunit A |
Protein accession | YP_001019014 |
Protein GI | 124024707 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0178] Excinuclease ATPase subunit |
TIGRFAM ID | [TIGR00630] excinuclease ABC, A subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.213667 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGGCGAG ATGCCGCCAA GGTCAAGGAT CATCGTTTGA GTAAGCAGCT GAATCTGAGT GGTGGCTCAT TGGAGGATGT GATTCGCGTG CGAGGTGCGC GTCAGCACAA CCTCAAGAAC GTTGATATCA CCCTTCCACG CAACAAGCTG GTGGTTTTAA CCGGAGTAAG TGGAAGCGGA AAGAGCTCTT TAGCGTTTGA CACGATTTTT GCTGAGGGTC AGCGTCGCTA CGTCGAGAGT CTTTCCGCTT ATGCCCGTCA GTTTTTGGGT CAGGTTGATA AGCCTGATGT CGATGCTATC GAGGGGTTGT CACCGGCGAT TTCCATTGAT CAGAAGTCGA CGAGTCACAA TCCTCGTTCA ACCGTTGGAA CGGTCACGGA AATTCAGGAT TATTTGCGCT TGCTATTCGG ACGTGCTGGA GAACCGCACT GTCCGCAATG TGATCGGCTA ATCCGTCCTC AGACCATCGA TGAGATGGTG GATCAGATCC TAATTTTGCC TGAAGGAACC CGTTATCAAT TGTTGGCACC GTTGGTTCGT GGAAAGAAGG GTACTCACGC CAAATTGTTG AGTGGCCTTG CGGCAGAGGG ATTTGCGAGG GTTCGGATTA ATGCAGAGGT TAGAGAGCTT GCCGACAACA TTGAATTGGA CAAGAACCAT CTGCACAGTA TTGAGGTGGT GGTTGATCGT CTGGTGGCTC GGGAAGGGAT TCAGGAACGA TTAACTGATT CATTGCGTAC CACTTTGATG CGCGGTGATG GCCTTGCTTT GGTGGAGGTT GTACCTAAAG CGGATCAAGC TCTCCCCGAG GGTGTGGAGC GTGAGCGGCT CTATTCCGAG AATTTTGCAT GTCCAGTGCA TGGGGCGGTG ATTGAGGAAC TTTCTCCCAG GCTATTTTCG TTTAATAGCC CTTATGGTGC TTGTTCTGAT TGTCATGGCA TCGGCCATCT TCGTAAGTTC ACTTTAGAAC GGGTTGTGCC TGATCCCTCC TTGCCTGTTT ATGCCGCTGT GGCGCCTTGG AGTGATAAGG ACAACAGCTA TTACTTTTCA TTGCTGTATT CAGTTGGTGA GGCCTTTGGT TTTGAAATTA AAACGCCATG GAAGAACTTA ACGGCAGATC AGCAACATGT ATTGCTTTAT GGAAGTTCTG AGCCGATTCA GATTCAGGCT GATAGCCGCT ATAAAAAGAG TACAGGTTAT ATGAGACCTT TTGAGGGCAT TTTGCCCATC TTGGAAAGAC AATTGCGTGA TGCAAGTGGC GAGGCTGTTA AACAGAAACT TGAGAAGTTT CTTGAGTTGG TCCCTTGTGG AAGTTGTGGT GGGAAGAGAT TGCGTGCAGA AGCTCTGGCG GTGAAGGTGG GTCCTTATCG GATTACTGAG CTCACTTCTA TCAGCGTGGC GCTGACTCTT GAACGCATCG AAAAGCTGAT GGGCGTTGGA GCAGCCAATG GATCTGAGCC TTTGCTCAAT TCACGTCAGA TCCAGATCGG TGATTTGGTT TTGCGGGAGA TTCGTATGCG TCTTCGCTTT CTTCTCGATG TTGGGCTCGA ATATCTCAGC TTGGATCGAC CCGCGATGAC CTTGTCTGGC GGTGAGGCTC AGCGGATTCG TTTGGCTACA CAGATCGGCG CTGGTCTTAC GGGTGTGCTT TATGTGCTTG ATGAACCAAG CATTGGTTTG CATCAGCGGG ACAATGATCG TCTGTTAGCC ACGCTCAGAC GTTTAAGGGA TCTGGGTAAC ACTTTAATAG TGGTAGAACA CGATGAGGAC ACCATTCGCG CCGCTGACCA TCTTGTCGAT ATTGGTCCAG GGGCAGGGGT GCATGGGGGC CATATTGTTG TTGAGGGATC TTTGGATCAT CTGTTGATGG CTGAAGAATC TTTGACAGGT GCATATCTCA GTGGCCGCCG CTCTATTCCT ACACCTAGAG AGAGGAGAGA AGGAAGCAGT CGCAGGCTTC GTTTGATTGA TTGTGATCGC AATAATCTTA AAAATCTCAC CGTAGATTTT CCTTTAGGGC GCCTCGTTGC GGTCACAGGA GTCAGTGGTA GTGGTAAGAG CACTTTGGTG AATGAGTTGC TTCATCCTGC CTTAGATCAC AGTTTGGGTT TGAAGGTGCC TTTCCCTCAA GGCTTGGGCG AGTTGCGAGG TGTCAATGCG ATCGACAAGG TGATCGTGAT TGACCAGTCT CCAATCGGCC GTACACCGCG TTCTAATCCA GCCACCTACA CCGGTGCGTT TGATCCGATT CGTCAGGTAT TTGCTGCTTC CGTTGAGGCA AAGGCTCGTG GTTATCAGGT GGGTCAGTTT AGTTTCAATG TGAAAGGTGG TCGCTGTGAG GCTTGTCGTG GTCAAGGGGT GAATGTGATT GAAATGAACT TTTTGCCAGA TGTTTATGTT CAGTGCGATG TCTGCAAAGG TGCTCGATTT AATCGGGAGA CCTTACAGGT CACTTACAAG GGACACACAA TTGCAGATGT TTTGCAGATG ACGGTTGAGC AGGCTGCTGA GGTTTTTTCT GCTATTCCCC AGGCGGCTGA TCGATTGCGC ACGTTGGTTG ATGTGGGACT TGGATATGTG AAGTTAGGTC AGCCTGCACC CACGCTTTCA GGTGGTGAAG CTCAACGTGT GAAGCTTGCT ACCGAGCTCT CCAAGCGGGC TACTGGCAAA ACCCTTTATT TGATTGATGA ACCAACCACT GGCCTCAGTT TTTACGACGT GCATAAGTTG ATGGATGTCA TGCAGCGTCT TGTTGATAAA GGCAATTCAA TTATCGTGAT AGAACACAAC TTGGATGTAA TTCGTTGTTC TGATTGGATT ATTGATCTTG GCCCTGAAGG AGGAGATTGT GGTGGCGATC TTCTGGTCAC GGGGACTCCA GAAGAGGTTG CATCCCATCC CACTAGCCAT ACGGGGCATT ATCTCAAGAA GGTGTTAGCG AAGCATCCTC CTGAGACTGT GTCTGTTGCA GCTTGA
|
Protein sequence | MGRDAAKVKD HRLSKQLNLS GGSLEDVIRV RGARQHNLKN VDITLPRNKL VVLTGVSGSG KSSLAFDTIF AEGQRRYVES LSAYARQFLG QVDKPDVDAI EGLSPAISID QKSTSHNPRS TVGTVTEIQD YLRLLFGRAG EPHCPQCDRL IRPQTIDEMV DQILILPEGT RYQLLAPLVR GKKGTHAKLL SGLAAEGFAR VRINAEVREL ADNIELDKNH LHSIEVVVDR LVAREGIQER LTDSLRTTLM RGDGLALVEV VPKADQALPE GVERERLYSE NFACPVHGAV IEELSPRLFS FNSPYGACSD CHGIGHLRKF TLERVVPDPS LPVYAAVAPW SDKDNSYYFS LLYSVGEAFG FEIKTPWKNL TADQQHVLLY GSSEPIQIQA DSRYKKSTGY MRPFEGILPI LERQLRDASG EAVKQKLEKF LELVPCGSCG GKRLRAEALA VKVGPYRITE LTSISVALTL ERIEKLMGVG AANGSEPLLN SRQIQIGDLV LREIRMRLRF LLDVGLEYLS LDRPAMTLSG GEAQRIRLAT QIGAGLTGVL YVLDEPSIGL HQRDNDRLLA TLRRLRDLGN TLIVVEHDED TIRAADHLVD IGPGAGVHGG HIVVEGSLDH LLMAEESLTG AYLSGRRSIP TPRERREGSS RRLRLIDCDR NNLKNLTVDF PLGRLVAVTG VSGSGKSTLV NELLHPALDH SLGLKVPFPQ GLGELRGVNA IDKVIVIDQS PIGRTPRSNP ATYTGAFDPI RQVFAASVEA KARGYQVGQF SFNVKGGRCE ACRGQGVNVI EMNFLPDVYV QCDVCKGARF NRETLQVTYK GHTIADVLQM TVEQAAEVFS AIPQAADRLR TLVDVGLGYV KLGQPAPTLS GGEAQRVKLA TELSKRATGK TLYLIDEPTT GLSFYDVHKL MDVMQRLVDK GNSIIVIEHN LDVIRCSDWI IDLGPEGGDC GGDLLVTGTP EEVASHPTSH TGHYLKKVLA KHPPETVSVA A
|
| |