Gene TM1040_2520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2520 
SymboluvrC 
ID4076522 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2659370 
End bp2661277 
Gene Length1908 bp 
Protein Length635 aa 
Translation table11 
GC content59% 
IMG OID638007844 
Productexcinuclease ABC subunit C 
Protein accessionYP_614514 
Protein GI99082360 
COG category[L] Replication, recombination and repair 
COG ID[COG0322] Nuclease subunit of the excinuclease complex 
TIGRFAM ID[TIGR00194] excinuclease ABC, C subunit 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCAGA ATCATATGTC AGAGACTATG AACGACATCA GCGCAGAATC GCCCGACCAA 
CCCGAGCCCC CCCGCACCGG ATACGGTGTC ATTCAGGCAT ACCTGAAGAC GCTTGATTCA
TCGCCGGGCG TGTACCGTAT GCTCGATCAC GAAAGCAGGG TGCTCTATGT CGGCAAGGCG
CGCAATCTGC GGGCGCGGGT GTCCAATTAC ACCCGCCCGG GACATACGCA GCGCATCGAG
ACGATGATCT CACAAACCAG CCGCATGATG TTCCTGACAA CGCGCACCGA AACCGAGGCG
CTCTTGTTGG AGCAAAACCT CATCAAGCAG CTGAAGCCCA AATACAATGT GCTGCTGCGC
GACGATAAAA GTTTTCCCTA CATCATGGTG AGCAAGAACC ATGCTTTTCC GCAGCTGAAA
AAGCACCGCG GCGCACGCAA GGGGAAGGCG AGCTTCTTTG GGCCCTTTGC CAGTGCCGGG
GCGGTGAACC GGACGTTGAA TCAGTTGCAA AAAGCGTTCT TGCTGCGCAA CTGCACAGAC
ACCATGTTTG AAAACCGCAC CCGGCCCTGC CTGCAGTATC AGATCAAACG CTGCTCTGGC
CCATGTGTGG GAAAGATCTC GCAGGCGGAT TACGCCGACA GTGTCCGGGA TGCAGAGCGG
TTTCTGGCGG GACGCTCCAC AAAGATCCAG GAAGAGCTTG GCGCTGAGAT GCAAGCCGCC
TCGGAAGCGA TGGAATATGA GCGCGCGGCA GCCTTGCGGG ACCGGATCAA GGCGCTGACG
CAGGTGCAAT CGGCGCAGGG CATCAACCCG CGTGGCGTGT CCGAGGCTGA CATCATTGGC
CTGCATTTGG AAAACGGGCT GGCCTGCGTG CAGGTGTTTT TTATTCGCGC CAATCAGAAC
TGGGGCAATC AGGACTTCTA CCCGCGCGTG GCCGAGGATA TGTCCGCCGC CGAAGTCATG
GAGGCTTTCA TTGGCCAGTT CTATGACAAC AAGGATGTTC CACGTCAGCT CATCTTGTCG
GATGACATCG AAAACGCAGA TCTGATGGCT GTGGCACTCA GTGAGAAAGC CCGGCGCAAG
GTGGAAATCG TGGTGCCCCA GCGGGGCGAG AAGACCGAGC TTGTGGCCTC GGCTGTGCGC
AATGCCCGTG AAAGCCTCGC TCGCCGGATG TCCGAGAGCG CCACACAGGC CAAACTTTTG
CGCGGCATTG CTGATGCTTT TGGGCTGGAA GCTCCGCCAA ACCGCATCGA GGTTTACGAC
AACTCTCACA TTCAGGGCAC CAACGCCGTC GGTGGCATGA TCGTCATGGG GCCTGAGGGC
TTTATGAAAA ACGCCTATCG TAAGTTCAAC ATCAAGGATG GTGAGGTCAT TGCAGGCGAT
GACTTTGGCA TGATGAAGGC GGTGCTGAAC CGCCGCTTCT CCCGCCTGTT GAAAGAAGAC
CCCGACCGCC AAAAGGGCAT GTGGCCGGAT CTTCTGCTCA TTGACGGCGG TGCGGGGCAG
GTATCGGCCG TGGCCGAGAT CATGGAGGAG CATGGCGTGC AGGACATTCC CATGGTCGGG
GTGGCCAAGG GTGTCGATCG CGACCATGGC AAGGAGGAGT TCTACCGCCC CGGCGAAAAC
GCCTTTGCGC TGCAACGCAA TGATCCTGTG CTTTACTTCA TTCAACGCAT GCGCGACGAG
GCGCACCGGT TTGCCATCGG CACCCACCGG GCCAAGCGGG CAAAATCTCT TGTGGCCAAT
CCATTGGATG ACATTCCCGG CGTCGGCGCG CGTCGCAAGA AGGCACTTCT GACGCATTTT
GGCAGCGCCA AGGCGGTGAG CCGCGCGAAC CTGTCGGATC TCAAGGCGGT GGACGGCGTC
TCAGACGCGC TGGCGGAAAC GATCTACAAC TATTTTCAGG TGCGCTGA
 
Protein sequence
MAQNHMSETM NDISAESPDQ PEPPRTGYGV IQAYLKTLDS SPGVYRMLDH ESRVLYVGKA 
RNLRARVSNY TRPGHTQRIE TMISQTSRMM FLTTRTETEA LLLEQNLIKQ LKPKYNVLLR
DDKSFPYIMV SKNHAFPQLK KHRGARKGKA SFFGPFASAG AVNRTLNQLQ KAFLLRNCTD
TMFENRTRPC LQYQIKRCSG PCVGKISQAD YADSVRDAER FLAGRSTKIQ EELGAEMQAA
SEAMEYERAA ALRDRIKALT QVQSAQGINP RGVSEADIIG LHLENGLACV QVFFIRANQN
WGNQDFYPRV AEDMSAAEVM EAFIGQFYDN KDVPRQLILS DDIENADLMA VALSEKARRK
VEIVVPQRGE KTELVASAVR NARESLARRM SESATQAKLL RGIADAFGLE APPNRIEVYD
NSHIQGTNAV GGMIVMGPEG FMKNAYRKFN IKDGEVIAGD DFGMMKAVLN RRFSRLLKED
PDRQKGMWPD LLLIDGGAGQ VSAVAEIMEE HGVQDIPMVG VAKGVDRDHG KEEFYRPGEN
AFALQRNDPV LYFIQRMRDE AHRFAIGTHR AKRAKSLVAN PLDDIPGVGA RRKKALLTHF
GSAKAVSRAN LSDLKAVDGV SDALAETIYN YFQVR