Gene TM1040_1098 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1098 
SymboluvrA 
ID4077805 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1177570 
End bp1180461 
Gene Length2892 bp 
Protein Length963 aa 
Translation table11 
GC content62% 
IMG OID638006402 
Productexcinuclease ABC subunit A 
Protein accessionYP_613093 
Protein GI99080939 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGAGT TGAAGAACAT TGAGGTGCGC GGCGCGCGCG AGCACAATCT CAAGAACATC 
GACGTGGACA TCCCGCGGGA TGAACTGGTG GTGATCACGG GCCTCTCCGG TTCCGGCAAG
TCGAGCCTTG CCTTTGACAC CATCTACGCC GAGGGGCAGC GCCGCTATGT CGAATCCCTC
AGCGCCTATG CGCGCCAGTT CCTCGATATG ATGGAAAAGC CCGATGTGGA TCATATCTCG
GGCCTGTCGC CTGCGATCTC CATCGAGCAG AAGACCACAT CGAAGAACCC GCGCTCGACC
GTCGGCACCG TCACCGAGAT CTATGATTAT CTGCGACTGC TGTTTGCCCG CGCGGGCACG
CCCTACAGCC CTGCCACCGG CCAGCCCATC GAGGCGCAGC AGGTGCAGGA TATGGTCGAC
CGGATCATGA CGCTTGAGGA GGGCACGCGG GGCTATCTGC TGGCGCCCAT CGTGCGCGAC
CGCAAGGGCG AGTACAAAAA AGAGATGCTG GAGCTCAGGA AACAGGGGTT CCAGCGCGTG
AAGGTGGACG GGGAGTTTCA CGATCTTGAT ACGCCGCCCA CGCTCGACAA GAAGTTCCGC
CATGACATCG ACGTGGTGGT CGACCGGCTG GTTGTAAAAG AGGGCATCGA GACCCGGCTG
GCGGACAGCT TGCGCACTGC GCTGGATCTG GCCGACGGGA TCGCCATTCT GGAGACCGCC
CCGCGTGAGG GCGACCCGGA GCGGATCACC TTTTCCGAGA ATTTCGCCTG TCCCGTCAGT
GGCTTCACCA TCCCCGAGAT CGAACCACGG CTGTTTTCCT TTAACGCGCC CTTTGGGGCC
TGTCCGCATT GTGACGGTCT GGGCGTGGAG CTGTTCTTTG ACGAACGTCT CGTGGTACCG
GATCAGAGCC TCAAGGTTTA TGACGGGGCG CTGGCGCCCT GGCGCAAGGG CAAATCGCCC
TATTTCCTGC AGACTATCGA GGCCATCGCC AACCACTATG AGTTCGACAA GAACACGCCG
TGGAAGGATC TGCCCGCGCA TGTGAAACAG GTGTTCCTGC ATGGCTCCGG CGACGAGGAA
ATCGCCTTTC GCTATGACGA AGGCGGGCGC GTCTACAATG TGACGCGCGT CTTTGAGGGC
GTGATCCCCA ATATGGAGCG CCGCTACCGC GAAACGGATT CCAACTGGGT GCGCGAGGAG
TTCGAGCGCT ACCAGAACAA CCGCGACTGC GGCCATTGTG GTGGGTATCG TCTGCGCGAA
GAGGCGCTGG CGGTCAAAAT CGGCCCGGCG GGAGGCCCCG CCGAGCATCG TCTGCATGTG
GGGCAGGTGG TGGAGAAATC CATCCGCGAG GCGCTGGCGT GGATCGAAGA GGTGCCGAGC
CATCTCAGCC CGCAAAAGCA GGAGATCGCC CGCGCCATCG TCAAGGAAAT CCGCGAGCGT
CTTGGGTTCC TCAACAATGT GGGGCTTGAG TATCTGACCC TCTCGCGCAA CGCGGGCACG
CTCTCGGGCG GGGAAAGCCA GCGGATTCGT CTGGCGAGCC AGATCGGCTC TGGCCTGACC
GGGGTGCTCT ATGTGTTGGA CGAGCCCTCC ATCGGCCTGC ACCAGCGTGA CAATGACCGG
CTGATCACCA CGCTCAAGAA CCTGCGCGAT CAGGGCAACA CGGTGATCGT GGTGGAACAT
GACGAGGATA TGATCCGGCA GGCCGATTAC GTCTTTGATA TTGGTCCCGG CGCCGGGGTG
CACGGCGGGC AGGTTGTCAG CCACGGCACG CCCGCCACGG TTGAGGGTGA TGCGGGTTCG
GTCACCGGTC AGTATCTGGC CGGAACGCGT GAGATTGCGG TGCCGGATAC GCGCCGCAAG
GGCAACAAGA AGAAGATCAA GGTGGTGAAG GCCTCCGGCA ACAACCTGAA GGACGTCACC
GCCGAATTCC CGCTGGGGAA ATTTGTCTGC GTCACCGGTG TGTCGGGCGG TGGCAAGTCC
ACGCTGACCA TCGAGACGCT GTTCAAGACT GCCTCGATGC GTCTCAACGG GGCGCGCCAG
ACGCCTGCGC CTTGCGAAAC CATCAAGGGG CTCGAGCATC TGGACAAGGT GATCGACATC
GACCAGCGCC CCATCGGGCG CACGCCACGC TCGAACCCGG CGACCTATAC CGGGGCCTTC
ACGCCGATCC GCGACTGGTT TGCCGGCCTG CCCGAAGCCA AGGCGCGCGG GTATAAACCG
GGGCGGTTTT CCTTTAACGT GAAGGGCGGA CGCTGCGAGG CCTGTCAGGG TGACGGGGTG
CTGAAGGTCG AGATGCATTT CCTGCCCGAC GTCTATGTCA CCTGCGAGAC CTGTCAGGGC
GCGCGCTACA ACCGCGAGAC GCTGGAGATC AAGTTCAAGG GCAAGAGCAT TGCCGATGTG
CTGGATATGA CAGTGGAGGA TGCGCAGGAG TTCTTTGCCG CCGTGCCGAC GATCCGCGAC
AAGATGGACG CGCTGATGCG GGTGGGTCTT GGCTATATCA AGGTCGGCCA GCAGGCCACC
ACACTGTCGG GCGGCGAGGC CCAGCGGGTG AAACTCTCAA AGGAACTCGC CAAACGGTCG
ACGGGCCGCA CGCTTTATAT CCTGGATGAG CCGACCACCG GTCTGCATTT TGAGGATGTG
CGCAAGCTCT TGGAAGTTCT GCATGAATTG GTTGAGCAGG GCAATTCCGT GGTGGTGATC
GAACACAACC TCGACGTGAT CAAGACGGCT GATTGGCTGA TCGACATTGG CCCCGAAGGC
GGTGATGGCG GCGGCGAGAT TGTCGCTGTG GGGACGCCGG AAAAGGTCGC CGAAGAGCCG
CGCAGTCACA CCGGGCGCTA TCTGAAGCCG ATGCTGGAAG CGCAGGCGCG CAAGAAGGTC
GCGGCGGAGT GA
 
Protein sequence
MAELKNIEVR GAREHNLKNI DVDIPRDELV VITGLSGSGK SSLAFDTIYA EGQRRYVESL 
SAYARQFLDM MEKPDVDHIS GLSPAISIEQ KTTSKNPRST VGTVTEIYDY LRLLFARAGT
PYSPATGQPI EAQQVQDMVD RIMTLEEGTR GYLLAPIVRD RKGEYKKEML ELRKQGFQRV
KVDGEFHDLD TPPTLDKKFR HDIDVVVDRL VVKEGIETRL ADSLRTALDL ADGIAILETA
PREGDPERIT FSENFACPVS GFTIPEIEPR LFSFNAPFGA CPHCDGLGVE LFFDERLVVP
DQSLKVYDGA LAPWRKGKSP YFLQTIEAIA NHYEFDKNTP WKDLPAHVKQ VFLHGSGDEE
IAFRYDEGGR VYNVTRVFEG VIPNMERRYR ETDSNWVREE FERYQNNRDC GHCGGYRLRE
EALAVKIGPA GGPAEHRLHV GQVVEKSIRE ALAWIEEVPS HLSPQKQEIA RAIVKEIRER
LGFLNNVGLE YLTLSRNAGT LSGGESQRIR LASQIGSGLT GVLYVLDEPS IGLHQRDNDR
LITTLKNLRD QGNTVIVVEH DEDMIRQADY VFDIGPGAGV HGGQVVSHGT PATVEGDAGS
VTGQYLAGTR EIAVPDTRRK GNKKKIKVVK ASGNNLKDVT AEFPLGKFVC VTGVSGGGKS
TLTIETLFKT ASMRLNGARQ TPAPCETIKG LEHLDKVIDI DQRPIGRTPR SNPATYTGAF
TPIRDWFAGL PEAKARGYKP GRFSFNVKGG RCEACQGDGV LKVEMHFLPD VYVTCETCQG
ARYNRETLEI KFKGKSIADV LDMTVEDAQE FFAAVPTIRD KMDALMRVGL GYIKVGQQAT
TLSGGEAQRV KLSKELAKRS TGRTLYILDE PTTGLHFEDV RKLLEVLHEL VEQGNSVVVI
EHNLDVIKTA DWLIDIGPEG GDGGGEIVAV GTPEKVAEEP RSHTGRYLKP MLEAQARKKV
AAE