Gene TDE1010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTDE1010 
SymboluvrA 
ID2740990 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTreponema denticola ATCC 35405 
KingdomBacteria 
Replicon accessionNC_002967 
Strand
Start bp1035367 
End bp1038231 
Gene Length2865 bp 
Protein Length954 aa 
Translation table11 
GC content46% 
IMG OID637159884 
Productexcinuclease ABC subunit A 
Protein accessionNP_971619 
Protein GI42526521 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAATA AAAATCAAAG CGGACACCTA AATAAACTCA TCGTAAAGGG TGCCCGTGAA 
CATAATTTGA AGAACATTGA TGTGGAGCTT CCCCGTGATA AGCTCATCGT TATATCCGGT
CTTTCAGGAT CGGGAAAAAG CTCCCTCGCC TTTGACACTA TTTTTGCCGA GGGGCAGCGC
CGTTATGTAG AATCCCTTTC GGCTTATGCA CGTCAGTTTT TAGGCAGAAT GGATAAGCCC
GATGTGGATT ACATCGAGGG GCTTTCCCCC GCTATCTCTA TAGAACAAAA AACTACGCAC
CGCAACCCCC GCTCCACGGT CGGAACGGTA ACCGAAATTT ATGACTACTA CCGCCTCCTT
TTTGCCCGTA CCGGCCATGC CCATTGCCCT TCTTGCGGAA AGGAGATTAA GGAGCAGACG
GTTGACCAGA TTATAGATAC GATTATGAGC TGGCCGGAAG GAACCCGCGT TCAGATTCTC
GCTCCCATTA TAAAGGGAAA AAAGGGAGAA CACCAAAAGA TAGTGAGCGA TGCAATAGCC
GCCGGTTTTG TGCGTGCCCG CATAGACGGC CTTCTTGTAA ACCTCGAAGA CGGGGTAAAG
CTTGATAAAC AAAAAAAACA CACAATCGAA ATAATCGTAG ACCGAATTCA GCTGTCGAAG
GATGTGCGGA AACGCCTTTC GGAATCGGTG GAAACGGCCT TGGAAAGTTC CGGCGGAACC
CTGCTTGCTA CAAGACAGGA CGATAAGGAT TCTCCCGTAA CCGAGGTTTT CTTTTCGCAA
AAAAATGCCT GTTCTGATTG CGGCATTTCG ATGCCCGAAT TGCAGCCCCG CCTTTTTTCT
TTTAATAACC CGATAGGGGC CTGCCCCGAA TGTACGGGAC TCGGAATGAC TCAGCACTTT
GACCAAGACC TGATAGCCCC CGATAAAAGC CTTTCGTTTA ATGAGGGCGT TTTCGTTCCG
TATAATCCCG AATCCGATTG GAACAGGGTG CGGTTTGAAG CCCTCGCCGC CCAATTCGAT
TTTTCTCTTG ATACGCCATT GAACAAGCTC CCCAAAAAAA TACAGACCAT CATTTGGGAA
GGTTCGGGCG ATACAAAGAT TCAGTTTTCT TATACGTCCA AAAGCGGATC GGGGAAATAT
TCTTATAACC GCCCTTGGCC GGGAATAATG GCCGACATGA ACCGGAAATA TAACGAATCT
TACTCGGCTT CTATCAGAGA ATATTATGAA AAGTTTATGT CGATAAAGCC CTGCAAAACC
TGCGGAGGAA TGAGGCTTAA ACCCGAAGTG CTGGCCGTAA CGGTGGGCAA TAAAAACATT
CATGACCTTA CCTGCCTTTC CGTCGGGGAT TCTATCGAGT TTTTTGAAAA ATTGAAGCTG
ACTGAAACGG AAGAACACAT AGCCTTTCAG ATTTTAAAAG AAATTAAGGC CCGCTTGGAA
TTTATGAAAA ACGTCGGCCT TGACTATTTA ACCTTAGAAA GAAAGGCTGC AACCCTTTCA
GGCGGCGAGG CTCAACGCAT CAGGCTTGCG ACCCAGATAG GTTCAAGTTT GATAGGTGTT
CTATATATTT TGGATGAGCC TTCGATAGGG CTTCATCAAC GGGATAATCA AAGGCTGATT
GATACTCTTT TGTACTTGCG TAATTTGGGG AATACCCTCA TCGTTGTAGA ACATGACGAG
CAAACCCTCC GCACGGCCGA CTACATTGTC GACCTCGGTC CGGGTGCAGG CGTTCACGGG
GGAAACATAA CTGCCCAAGG TACGCCTGAT GAAGTTGCAA AAGTAAAAAA CAGTTTAACG
GGGCAATATC TTGCAGGTAC GCTTAAAATG GATATTCCTA AAGAAAGGCG GAAAGGTAAC
GGAAATGCTT TGGAGCTTTC AGGGGTGAGC GAGCATAATC TAAAAGATGT TTCTATAAAA
ATTCCTTTGG GAGCCTTTAC CTGTATTACC GGAGTTTCGG GCTCGGGAAA ATCAACCCTT
TTAACCGATG TTCTATATCC GGCAGTTTCA AACAAGATTA TGCGTTCCTC TATTCCTGAA
GGGGCCTATA AAAAACTGAG CGGCCTTGAG CACATAGACA AGGTTATCAA TATCGATCAA
AGCCCCATCG GGAGAACACC ACGCTCAAAC CCTGCGACCT ATGTAGGCGT TTTTACGGGG
ATAAGGGACT TGTTTGCAAG CCTTCCCGAA TCGAAGGCGA GGGGTTATAA GCCCGGCCGC
TTTTCGTTTA ATGTAAGGGG CGGAAGGTGC GAGCATTGTC AGGGCGACGG AACCCTCACA
ATCGAGATGA ACTTTTTGCC CGATGTTTAT ATAGCCTGCG ATGTTTGCCG AGGGAAACGG
TTTAACAAAG AGACCCTCGA TGTCCGCTAT AAGGGAAAAA ACATTGCCGA TGTTTTGGAT
ATGACCATCG AGGAGGCTTC GGAATTTTTT GCTCCCATTC CTCACATTGC CCGAAAACTT
CAAACCCTCT TATCGGTCGG TTTGGGTTAT ATAAAATTAG GCCAATCGGC TCTTACTCTT
TCTGGAGGAG AAGCCCAACG GGTAAAACTT GCAAACGAAC TTGCCAAACG TTCTACAGGT
AAGACCCTCT ACATTTTGGA TGAACCTACA ACGGGCTTAC ACTTTGCGGA TGTCAAGCAG
CTCATGCAGG TTATTCATCG CCTCATAGAT CAGGGGAACA CCGTTATTAT GATAGAACAC
AACCTTGATG TTATCTTACA GGCCGATAAG ATTATCGACC TCGGCCCCGA AGGCGGAACC
AACGGAGGGC AAATCATAGC GGAAGGAACA CCTGAAGAAG TAGCAAAGAT AAAAAAATCC
TATACGGGAT ATTATATAAA GGAAATGCTT GAAAGAGTCA GGTAA
 
Protein sequence
MNNKNQSGHL NKLIVKGARE HNLKNIDVEL PRDKLIVISG LSGSGKSSLA FDTIFAEGQR 
RYVESLSAYA RQFLGRMDKP DVDYIEGLSP AISIEQKTTH RNPRSTVGTV TEIYDYYRLL
FARTGHAHCP SCGKEIKEQT VDQIIDTIMS WPEGTRVQIL APIIKGKKGE HQKIVSDAIA
AGFVRARIDG LLVNLEDGVK LDKQKKHTIE IIVDRIQLSK DVRKRLSESV ETALESSGGT
LLATRQDDKD SPVTEVFFSQ KNACSDCGIS MPELQPRLFS FNNPIGACPE CTGLGMTQHF
DQDLIAPDKS LSFNEGVFVP YNPESDWNRV RFEALAAQFD FSLDTPLNKL PKKIQTIIWE
GSGDTKIQFS YTSKSGSGKY SYNRPWPGIM ADMNRKYNES YSASIREYYE KFMSIKPCKT
CGGMRLKPEV LAVTVGNKNI HDLTCLSVGD SIEFFEKLKL TETEEHIAFQ ILKEIKARLE
FMKNVGLDYL TLERKAATLS GGEAQRIRLA TQIGSSLIGV LYILDEPSIG LHQRDNQRLI
DTLLYLRNLG NTLIVVEHDE QTLRTADYIV DLGPGAGVHG GNITAQGTPD EVAKVKNSLT
GQYLAGTLKM DIPKERRKGN GNALELSGVS EHNLKDVSIK IPLGAFTCIT GVSGSGKSTL
LTDVLYPAVS NKIMRSSIPE GAYKKLSGLE HIDKVINIDQ SPIGRTPRSN PATYVGVFTG
IRDLFASLPE SKARGYKPGR FSFNVRGGRC EHCQGDGTLT IEMNFLPDVY IACDVCRGKR
FNKETLDVRY KGKNIADVLD MTIEEASEFF APIPHIARKL QTLLSVGLGY IKLGQSALTL
SGGEAQRVKL ANELAKRSTG KTLYILDEPT TGLHFADVKQ LMQVIHRLID QGNTVIMIEH
NLDVILQADK IIDLGPEGGT NGGQIIAEGT PEEVAKIKKS YTGYYIKEML ERVR