Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_4529 |
Symbol | uvrA |
ID | 4246183 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 6986412 |
End bp | 6989426 |
Gene Length | 3015 bp |
Protein Length | 1004 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 638109406 |
Product | excinuclease ABC subunit A |
Protein accession | YP_723982 |
Protein GI | 113477921 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0178] Excinuclease ATPase subunit |
TIGRFAM ID | [TIGR00630] excinuclease ABC, A subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.388668 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCAATA TCCAAAACGG ACATCATTCT TACCCCAGCA ATGAAAATAC CATCCGCATC CGAGGAGCCA GACAACATAA CCTCAAAAAC ATCAACCTAG ACCTACCACG CGATCGCCTC ATAGTCTTCA CAGGAGTATC CGGTTCCGGC AAATCATCCC TCGCCTTCGA CACAATTTTT GCCGAAGGGC AACGTCGCTA CGTCGAATCC CTCAGTGCCT ATGCCCGACA ATTTCTCGGA CAGCTCGACA AACCAGATGT GGACTTCATC GAAGGATTAA GCCCTGCCAT TTCCATCGAC CAAAAATCCA CATCCCATAA CCCTCGCTCC ACAGTTGGAA CCGTCACCGA AATTTACGAC TATCTAAGAC TACTATTTGG TAGAGCCGGA GAACCCCATT GCCCTATCTG CCACCATAAT ATTGCCCCCC AAACTATTGA TGAAATGTGC GACCGAGTCA TGGCTTTGCC AGACCGCACT AAATTTTATA TTTTAGCCCC AGTGGTTCGA GGCAAAAAAG GGACTCATAA GAAACTTTTG TCCAGTTTAG CTGCTCAAGG ATTTGTTCGT TTGAGAGTTG ATGGAGAGGT TGTAGAAATT GCTGAAAACA TCAAATTAGA TAAAAATCAT ACACATACTA TAGAAATTGT TATTGACAGG CTAATCAAAA AACCAGGTAT AGAAGAACGT TTAGCAGATT CTTTAAATAC TTGTCTACGT CAATCTACTG GAATTGCTTT GATTAAAGTA TTAAATAATA CATCAGCATA CACAGGGGCA GTACCTACAA AAAATAAGTA TAACCTAAAT CCGGAAAAAA TAGCAACTTC AGCTAATTCT AGATCTGGGA ATAATTCAGC TAATTCTAGT CAGGAACTAG AAGTAGAAAT GGAAAAGTTT GAGACTCAAA TAGTCTTTTC GGAAAATTTT GCTTGTCCGG AGCATGGAGC CGTGATGGAG GAGTTGTCGC CCCGGCTGTT TTCTTTCAAT TCTCCTTATG GCGCTTGTCC GACTTGTCAC GGCTTGGGTA GTCTAAAGCA ATTTTCCCCG GAGTTGATAG TACCAGACCC TAATGCACCT TTATATTCAG CGATCGCTCC CTGGTCGAAC AAAGAAAATC CTTATTATTT TTCTCTGCTT TATAGTTTAG CTGAAGCTTA TGATTTTGAT ATAGAAACTC CCTGGAATAA ATTAAGTAAA AAAGAGCAGA AGTTAGTGCT TGAAGGTAGC GACGAACCTA TTTGGATAGA AATGAAAAAT GGAGAAGGAG ATTATCGTTA CTATCCTGGA GTTATCCCTA CTTTAGAAAA GCAATATAAA GAAACAGGTT CAGATTTAAT GAAACAAAAA TTAGAGCAAT ATTTAATTAA TCAAACCTGT GAAACCTGCC AAGGAAAAAG ATTAAAACCA GAAGCACTTT CCGTAGAAAT AGGGCAATAT AGAATTACTG ATTTTACAGA AGTTTCAATT CGAGAATGTT TGGAAAAAAT TAATAGCTTA CAACTGAGTG ATCGCCAGGC AAAAATAGCA GAATTAGTAT TGAAAGAAAT TCGAGCCAGA CTAAATTTTC TTCTAGATGT TGGCTTAGAT TATTTAACAT TAGACCGAGC AACAATGACA CTTTCTGGAG GAGAAGCTCA AAGAATTAGA TTAGCAACAC AAATTGGTTC TGGCTTAACA GGAGTTCTCT ATGTTTTAGA CGAACCAAGT ATTGGTTTGC ATCAAAGAGA TAATAATCGT CTGTTGCAAA CTTTAAGCAA ACTTCGCGAT TTAAAAAATA CATTAATAGT TGTGGAACAT GATGAGGAAA CTATTAAAGC AGCCGACCAT ATTATTGATA TTGGTCCGGG TGCGGGAGTT CATGGCGGGC GGATAATTTC TCAGGGAAAT TTTCAGACAT TATTAGAAAC GGAAGAGTCA TTAACTGGTG CTTATTTATC TGGCAAAAAA AATATTACTA CTCCATCTGA AAGAAGAGGA GGAAATGGAA AATCTTTACT TTTGAATAAT TGTCATCGAA ACAATCTCAA AAATATAGAT ATAGAGATTC CTTTGGGAAA ACTTGTCTGT ATTACTGGGG TTTCTGGTTC AGGAAAATCG ACCTTAATGA ACGAATTAAT TTATCCAGCT TTGCAACATT ATCTCAGTCG TAATGTTCCT TTTCCTAAAC ATTTAGAAAA AATTAAAGGA TTAAAAGCAA TAGATAAAGT AATAGTAATT GACCAATCAC CTATCGGCAG AACTCCCCGT TCAAATCCTG CAACTTATAC AGGAGTATTT GATGTAATTC GAGGAATATT TGCAGAAACT ATAGAAGCAA AAGCTAGAGG TTATAAGCCA GGGCAATTTT CTTTTAATGT TAAAGGTGGC AGATGTGAAG CTTGTAGCGG ACAAGGTGTA AATGTAATTG AAATGAATTT TTTGCCAGAT GTTTATGTAC AATGTGAGGT TTGTAAGGGT GCAAGATATA GTAGAGAAAC TTTGCAGGTG AGATATAAAG ATAAGTCAAT TGCTGATGTT TTAGATATGA CTGTAGAGGA AGGTTTGGAA ATATTTAAAA ATATTCCCAG GGCAGCAAGT AGATTACAAA CTTTAGTGGA TGTGGGATTA GGTTATATCA AATTAGGTCA GCCTGCACCG ACTCTTTCTG GAGGAGAAGC ACAAAGAGTA AAATTAGCTT CTGAATTGTC TAAGAGAGCA ACGGGAAAAA CTATTTATTT GATAGATGAA CCAACAACTG GTTTATCATT TTATGATGTT CATCAGTTAT TAAATGTTTT GCAAAGATTG GTAGATAAAG GAAATTCAAT TGTAGTAATT GAGCATAATT TAGATGTGAT TCGTTGTGCC GACTGGGTAC TAGATCTAGG CCCGGAAGGA GGAGATAAAG GAGGAGAAAT TATTGTTTGT GGAACCCCTG AAGAGGTGGC AGATAATTTT GAGTCTTATA CTGGAAAATA TTTGCGGGAG GTATTGGAAA AGTATCCACC TGAAGCTGAA AAAATTGATA TTTAA
|
Protein sequence | MTNIQNGHHS YPSNENTIRI RGARQHNLKN INLDLPRDRL IVFTGVSGSG KSSLAFDTIF AEGQRRYVES LSAYARQFLG QLDKPDVDFI EGLSPAISID QKSTSHNPRS TVGTVTEIYD YLRLLFGRAG EPHCPICHHN IAPQTIDEMC DRVMALPDRT KFYILAPVVR GKKGTHKKLL SSLAAQGFVR LRVDGEVVEI AENIKLDKNH THTIEIVIDR LIKKPGIEER LADSLNTCLR QSTGIALIKV LNNTSAYTGA VPTKNKYNLN PEKIATSANS RSGNNSANSS QELEVEMEKF ETQIVFSENF ACPEHGAVME ELSPRLFSFN SPYGACPTCH GLGSLKQFSP ELIVPDPNAP LYSAIAPWSN KENPYYFSLL YSLAEAYDFD IETPWNKLSK KEQKLVLEGS DEPIWIEMKN GEGDYRYYPG VIPTLEKQYK ETGSDLMKQK LEQYLINQTC ETCQGKRLKP EALSVEIGQY RITDFTEVSI RECLEKINSL QLSDRQAKIA ELVLKEIRAR LNFLLDVGLD YLTLDRATMT LSGGEAQRIR LATQIGSGLT GVLYVLDEPS IGLHQRDNNR LLQTLSKLRD LKNTLIVVEH DEETIKAADH IIDIGPGAGV HGGRIISQGN FQTLLETEES LTGAYLSGKK NITTPSERRG GNGKSLLLNN CHRNNLKNID IEIPLGKLVC ITGVSGSGKS TLMNELIYPA LQHYLSRNVP FPKHLEKIKG LKAIDKVIVI DQSPIGRTPR SNPATYTGVF DVIRGIFAET IEAKARGYKP GQFSFNVKGG RCEACSGQGV NVIEMNFLPD VYVQCEVCKG ARYSRETLQV RYKDKSIADV LDMTVEEGLE IFKNIPRAAS RLQTLVDVGL GYIKLGQPAP TLSGGEAQRV KLASELSKRA TGKTIYLIDE PTTGLSFYDV HQLLNVLQRL VDKGNSIVVI EHNLDVIRCA DWVLDLGPEG GDKGGEIIVC GTPEEVADNF ESYTGKYLRE VLEKYPPEAE KIDI
|
| |