Gene Tery_4529 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4529 
SymboluvrA 
ID4246183 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp6986412 
End bp6989426 
Gene Length3015 bp 
Protein Length1004 aa 
Translation table11 
GC content38% 
IMG OID638109406 
Productexcinuclease ABC subunit A 
Protein accessionYP_723982 
Protein GI113477921 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.388668 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAATA TCCAAAACGG ACATCATTCT TACCCCAGCA ATGAAAATAC CATCCGCATC 
CGAGGAGCCA GACAACATAA CCTCAAAAAC ATCAACCTAG ACCTACCACG CGATCGCCTC
ATAGTCTTCA CAGGAGTATC CGGTTCCGGC AAATCATCCC TCGCCTTCGA CACAATTTTT
GCCGAAGGGC AACGTCGCTA CGTCGAATCC CTCAGTGCCT ATGCCCGACA ATTTCTCGGA
CAGCTCGACA AACCAGATGT GGACTTCATC GAAGGATTAA GCCCTGCCAT TTCCATCGAC
CAAAAATCCA CATCCCATAA CCCTCGCTCC ACAGTTGGAA CCGTCACCGA AATTTACGAC
TATCTAAGAC TACTATTTGG TAGAGCCGGA GAACCCCATT GCCCTATCTG CCACCATAAT
ATTGCCCCCC AAACTATTGA TGAAATGTGC GACCGAGTCA TGGCTTTGCC AGACCGCACT
AAATTTTATA TTTTAGCCCC AGTGGTTCGA GGCAAAAAAG GGACTCATAA GAAACTTTTG
TCCAGTTTAG CTGCTCAAGG ATTTGTTCGT TTGAGAGTTG ATGGAGAGGT TGTAGAAATT
GCTGAAAACA TCAAATTAGA TAAAAATCAT ACACATACTA TAGAAATTGT TATTGACAGG
CTAATCAAAA AACCAGGTAT AGAAGAACGT TTAGCAGATT CTTTAAATAC TTGTCTACGT
CAATCTACTG GAATTGCTTT GATTAAAGTA TTAAATAATA CATCAGCATA CACAGGGGCA
GTACCTACAA AAAATAAGTA TAACCTAAAT CCGGAAAAAA TAGCAACTTC AGCTAATTCT
AGATCTGGGA ATAATTCAGC TAATTCTAGT CAGGAACTAG AAGTAGAAAT GGAAAAGTTT
GAGACTCAAA TAGTCTTTTC GGAAAATTTT GCTTGTCCGG AGCATGGAGC CGTGATGGAG
GAGTTGTCGC CCCGGCTGTT TTCTTTCAAT TCTCCTTATG GCGCTTGTCC GACTTGTCAC
GGCTTGGGTA GTCTAAAGCA ATTTTCCCCG GAGTTGATAG TACCAGACCC TAATGCACCT
TTATATTCAG CGATCGCTCC CTGGTCGAAC AAAGAAAATC CTTATTATTT TTCTCTGCTT
TATAGTTTAG CTGAAGCTTA TGATTTTGAT ATAGAAACTC CCTGGAATAA ATTAAGTAAA
AAAGAGCAGA AGTTAGTGCT TGAAGGTAGC GACGAACCTA TTTGGATAGA AATGAAAAAT
GGAGAAGGAG ATTATCGTTA CTATCCTGGA GTTATCCCTA CTTTAGAAAA GCAATATAAA
GAAACAGGTT CAGATTTAAT GAAACAAAAA TTAGAGCAAT ATTTAATTAA TCAAACCTGT
GAAACCTGCC AAGGAAAAAG ATTAAAACCA GAAGCACTTT CCGTAGAAAT AGGGCAATAT
AGAATTACTG ATTTTACAGA AGTTTCAATT CGAGAATGTT TGGAAAAAAT TAATAGCTTA
CAACTGAGTG ATCGCCAGGC AAAAATAGCA GAATTAGTAT TGAAAGAAAT TCGAGCCAGA
CTAAATTTTC TTCTAGATGT TGGCTTAGAT TATTTAACAT TAGACCGAGC AACAATGACA
CTTTCTGGAG GAGAAGCTCA AAGAATTAGA TTAGCAACAC AAATTGGTTC TGGCTTAACA
GGAGTTCTCT ATGTTTTAGA CGAACCAAGT ATTGGTTTGC ATCAAAGAGA TAATAATCGT
CTGTTGCAAA CTTTAAGCAA ACTTCGCGAT TTAAAAAATA CATTAATAGT TGTGGAACAT
GATGAGGAAA CTATTAAAGC AGCCGACCAT ATTATTGATA TTGGTCCGGG TGCGGGAGTT
CATGGCGGGC GGATAATTTC TCAGGGAAAT TTTCAGACAT TATTAGAAAC GGAAGAGTCA
TTAACTGGTG CTTATTTATC TGGCAAAAAA AATATTACTA CTCCATCTGA AAGAAGAGGA
GGAAATGGAA AATCTTTACT TTTGAATAAT TGTCATCGAA ACAATCTCAA AAATATAGAT
ATAGAGATTC CTTTGGGAAA ACTTGTCTGT ATTACTGGGG TTTCTGGTTC AGGAAAATCG
ACCTTAATGA ACGAATTAAT TTATCCAGCT TTGCAACATT ATCTCAGTCG TAATGTTCCT
TTTCCTAAAC ATTTAGAAAA AATTAAAGGA TTAAAAGCAA TAGATAAAGT AATAGTAATT
GACCAATCAC CTATCGGCAG AACTCCCCGT TCAAATCCTG CAACTTATAC AGGAGTATTT
GATGTAATTC GAGGAATATT TGCAGAAACT ATAGAAGCAA AAGCTAGAGG TTATAAGCCA
GGGCAATTTT CTTTTAATGT TAAAGGTGGC AGATGTGAAG CTTGTAGCGG ACAAGGTGTA
AATGTAATTG AAATGAATTT TTTGCCAGAT GTTTATGTAC AATGTGAGGT TTGTAAGGGT
GCAAGATATA GTAGAGAAAC TTTGCAGGTG AGATATAAAG ATAAGTCAAT TGCTGATGTT
TTAGATATGA CTGTAGAGGA AGGTTTGGAA ATATTTAAAA ATATTCCCAG GGCAGCAAGT
AGATTACAAA CTTTAGTGGA TGTGGGATTA GGTTATATCA AATTAGGTCA GCCTGCACCG
ACTCTTTCTG GAGGAGAAGC ACAAAGAGTA AAATTAGCTT CTGAATTGTC TAAGAGAGCA
ACGGGAAAAA CTATTTATTT GATAGATGAA CCAACAACTG GTTTATCATT TTATGATGTT
CATCAGTTAT TAAATGTTTT GCAAAGATTG GTAGATAAAG GAAATTCAAT TGTAGTAATT
GAGCATAATT TAGATGTGAT TCGTTGTGCC GACTGGGTAC TAGATCTAGG CCCGGAAGGA
GGAGATAAAG GAGGAGAAAT TATTGTTTGT GGAACCCCTG AAGAGGTGGC AGATAATTTT
GAGTCTTATA CTGGAAAATA TTTGCGGGAG GTATTGGAAA AGTATCCACC TGAAGCTGAA
AAAATTGATA TTTAA
 
Protein sequence
MTNIQNGHHS YPSNENTIRI RGARQHNLKN INLDLPRDRL IVFTGVSGSG KSSLAFDTIF 
AEGQRRYVES LSAYARQFLG QLDKPDVDFI EGLSPAISID QKSTSHNPRS TVGTVTEIYD
YLRLLFGRAG EPHCPICHHN IAPQTIDEMC DRVMALPDRT KFYILAPVVR GKKGTHKKLL
SSLAAQGFVR LRVDGEVVEI AENIKLDKNH THTIEIVIDR LIKKPGIEER LADSLNTCLR
QSTGIALIKV LNNTSAYTGA VPTKNKYNLN PEKIATSANS RSGNNSANSS QELEVEMEKF
ETQIVFSENF ACPEHGAVME ELSPRLFSFN SPYGACPTCH GLGSLKQFSP ELIVPDPNAP
LYSAIAPWSN KENPYYFSLL YSLAEAYDFD IETPWNKLSK KEQKLVLEGS DEPIWIEMKN
GEGDYRYYPG VIPTLEKQYK ETGSDLMKQK LEQYLINQTC ETCQGKRLKP EALSVEIGQY
RITDFTEVSI RECLEKINSL QLSDRQAKIA ELVLKEIRAR LNFLLDVGLD YLTLDRATMT
LSGGEAQRIR LATQIGSGLT GVLYVLDEPS IGLHQRDNNR LLQTLSKLRD LKNTLIVVEH
DEETIKAADH IIDIGPGAGV HGGRIISQGN FQTLLETEES LTGAYLSGKK NITTPSERRG
GNGKSLLLNN CHRNNLKNID IEIPLGKLVC ITGVSGSGKS TLMNELIYPA LQHYLSRNVP
FPKHLEKIKG LKAIDKVIVI DQSPIGRTPR SNPATYTGVF DVIRGIFAET IEAKARGYKP
GQFSFNVKGG RCEACSGQGV NVIEMNFLPD VYVQCEVCKG ARYSRETLQV RYKDKSIADV
LDMTVEEGLE IFKNIPRAAS RLQTLVDVGL GYIKLGQPAP TLSGGEAQRV KLASELSKRA
TGKTIYLIDE PTTGLSFYDV HQLLNVLQRL VDKGNSIVVI EHNLDVIRCA DWVLDLGPEG
GDKGGEIIVC GTPEEVADNF ESYTGKYLRE VLEKYPPEAE KIDI