Gene TM1040_2142 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2142 
SymboldnaG 
ID4076456 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2247809 
End bp2249818 
Gene Length2010 bp 
Protein Length669 aa 
Translation table11 
GC content62% 
IMG OID638007462 
ProductDNA primase 
Protein accessionYP_614136 
Protein GI99081982 
COG category[L] Replication, recombination and repair 
COG ID[COG0358] DNA primase (bacterial type) 
TIGRFAM ID[TIGR01391] DNA primase, catalytic core 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.512504 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCTGC CCCCCGGTTT CCTTGATGAA TTGCGCACCC GCATCAGCCT GTCTGACGTT 
GTCGGGCGCA AGGTCATGTG GGATCAGCGC AAAAGCCAGC CGGGCAAGGG CGACATGTGG
TCTCCGTGCC CCTTTCATCA TGAAAAAACC GCTTCTTTTC ATGTGGATGA TCGCAAAGGA
TTTTACTACT GCTTTGGCTG TCACGCCAAA GGGGATGCGC TGAAATTCGT GCAGGAGACA
GAGAATGTCT CCTTTATGGA GGCGGTAGAG ATCCTTGCGG GCGAGGCCGG GCTGGAGATG
CCCAAGCGCG ACCCCAAAGC GGCTGAGAAA CAGGACCGTC GCACGCAGCT TGCTGAGGTG
ATGGAACAGG CGGTGAAATA CTTCCGCCTG CAACTCAAGA CTCAAGCCGC GAGTGAGGCG
CGCGCCTACC TCGCGCGTCG GGGGCTTGAT GCGGCAGCGC TGGACCGCTG GGAGATCGGC
TTTGCCCCCG ATGGCTGGCA GAACCTTTGG GATGCGCTCA AAGGCAAGGA CATCGCGGAT
GATCTGATCC TCGGTGCCGG GCTGGCCAAA CCCTCGAACA AGGGCAAGAA GCCCTATGAT
GTGTTTCGCA ACCGGATCAT GTTCCCGATC CGGAATGCGC GGGGCCGGGC GATCGCCTTT
GGCGGCCGGG CGATGGATCC CAATGACAAC GCCAAATACC TGAACTCGCC GGAAACCGAG
CTGTTTGATA AGGGGCGCAA TCTCTATAAC CACGGGCCCG CGCGCGCGGC CTCGGGACGG
GGCAGCACCT TGATCGTGGC CGAAGGCTAC ATGGATGTGA TCGCGCTGTC CGAGGCGGGG
TTCCAGTCGG CGGTGGCGCC CTTGGGTACA GCCGTCACCG ACAACCAGCT GCATCTTTTA
TGGCGCATGT CGGACGAGCC GGTGATTGCG CTGGATGGGG ATACGGCTGG TCTGCGCGCG
GCGATGCGGC TGATCGATAT GGCGCTGCCT CTGCTCGAGG CCGGGAAGTC CTTGCGCTTT
GCCATCATGC CCGAGGGCAA GGACCCAGAT GATCTTATTC GCGCCGAGGG ACCAGAGGCG
GTGCAGGCGG TGCTGGATCA GGCGATTCCG ATGGTACAGC TGCTCTGGCG CCGCGAGACC
GAAGGGCGGG TGTTTGACAG CCCCGAGCGC AAAGCGGCCC TGGACAAGTC CCTGCGTGAA
AAGATCAAAT TGATCCGTGA CCCTTCTATT CGCCTGCATT ACGCCAATGA TATCAAGGAG
CTTCGCTTTC AGCTCTTTCG CGGTCAGCGT GGCGGTGGCG GGCAGGGGGG CACCGGCTAT
GGCCAGCGGT ACGGCGGTTT CAAACCGGGC TCCCGCATGC AGGGTGGATT TCGCGACGGC
GCCCGCATGC CTTGGGGTGC GCCGCCCCCG CCCCGCAGCA CCACACGGGC CTCGATGGTG
GCCTCCTCCG AGGAACAGGA CTTTGCCCGC GTGCGCGAGG CGGTGATCCT CGCCACAGCA
ATCACCACGC CCACGGTGAT CCCGGAATTC GAAATCAATC TTGAGCGCAT GCAGTGTCAG
GACATGGAAC ATGCCCACCT GCGCGATCTG GTTCTGCGCT ATGGGATCGA GGCCCCGGAG
CGGTTGCGCG ACGAAATTGC CTATGCTCTT GGGCCGGACG CTCTTGAAAA CCTGCTGTCG
CTGCGCCATG TGGCTATTAG CCCCTGCGTG CGCAAACCTG GCGACCTGGA TATGGCCAGC
CTGACCCTGG CCGAGGAATT CGCAAAACTC GAGAGCATCG CGGGGTTGAA TGCTGAGCTG
GCGGAAGCCG AGGAAGATCT GGATGGTCTG GCGGATGAGG CCCTGACCTG GCGTTTGCGT
CAGGCGGCAG AGGCCCGCAA CCGGGCTGTG CGCAGCGAAA ATGAGGATAA GGCAACTTAC
GACGTGGGGG AGAACGGGGC GCGGCTCAAC CGGGATGAGC GCGATGCCTT CGGAGACCTG
CTCAAGCGAA TCGGTTTCAA CAGCAAATGA
 
Protein sequence
MSLPPGFLDE LRTRISLSDV VGRKVMWDQR KSQPGKGDMW SPCPFHHEKT ASFHVDDRKG 
FYYCFGCHAK GDALKFVQET ENVSFMEAVE ILAGEAGLEM PKRDPKAAEK QDRRTQLAEV
MEQAVKYFRL QLKTQAASEA RAYLARRGLD AAALDRWEIG FAPDGWQNLW DALKGKDIAD
DLILGAGLAK PSNKGKKPYD VFRNRIMFPI RNARGRAIAF GGRAMDPNDN AKYLNSPETE
LFDKGRNLYN HGPARAASGR GSTLIVAEGY MDVIALSEAG FQSAVAPLGT AVTDNQLHLL
WRMSDEPVIA LDGDTAGLRA AMRLIDMALP LLEAGKSLRF AIMPEGKDPD DLIRAEGPEA
VQAVLDQAIP MVQLLWRRET EGRVFDSPER KAALDKSLRE KIKLIRDPSI RLHYANDIKE
LRFQLFRGQR GGGGQGGTGY GQRYGGFKPG SRMQGGFRDG ARMPWGAPPP PRSTTRASMV
ASSEEQDFAR VREAVILATA ITTPTVIPEF EINLERMQCQ DMEHAHLRDL VLRYGIEAPE
RLRDEIAYAL GPDALENLLS LRHVAISPCV RKPGDLDMAS LTLAEEFAKL ESIAGLNAEL
AEAEEDLDGL ADEALTWRLR QAAEARNRAV RSENEDKATY DVGENGARLN RDERDAFGDL
LKRIGFNSK