Gene TM1040_3669 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3669 
Symbol 
ID4075638 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp721903 
End bp723342 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content60% 
IMG OID638005189 
ProductDNA polymerase III, epsilon subunit 
Protein accessionYP_611898 
Protein GI99078640 
COG category[L] Replication, recombination and repair 
COG ID[COG2176] DNA polymerase III, alpha subunit (gram-positive type) 
TIGRFAM ID[TIGR00573] exonuclease, DNA polymerase III, epsilon subunit family 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.731231 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTAGAT CCCTGTCCCT TCGCCTGCGC ATCTTCCTGT TCTTCTGCCT CCTGGCTGTG 
GGAGCAATCG CGCTGGCTGC GGTGGCGTTG GGTTTTGTCT GGACCCGCTC TGATTCAGAG
TGGACGGCTT CAGAACTCAC CACGGTGATC CTGTTGTTCG GGTTTTTGAA TACAGGTCTG
GTGCTGGGGA TCTGGCTGCT GTTTGACGAA AACGTCGCGC GCCCGATCGA GGGGCTTTCG
ACCAGTCTGC GCCTGCGTGC CCACTCCGGA ATTGAGGACA GCATCAAAGC ACAGGCAGCG
CAATACCTCG GCGATCTGGC CCCGGCGGCA CGCGCGCTGT CGGATGCGCT TGCGGCTTCT
GGAGATCAGC CACCAGAACA GACCCAGCGC CTGCTTCAGG AACGGGAGCG GCTGACTGCG
TTGCTCAGCG AAATTCCCAT TGCGACGATC CTTGTAAACG CAGCAGGTGA GATTGCACTC
TATGACGCGC AGGCCGGGGC CATCCTCTCC CGCATCGCCG CACCGCGCCT TGGGGCGCCA
CTCTCGAACT ATTTCGATCT GACGCCCGCA ATCGGCGCCT GTGCCCGAGC AGACCGGCGC
CTCGTAGCGG GGCATGTGAC CGTCCCAGAC TGCAATCATG CCGAAACGTT CAATCTGCAG
ATAAAATCGC TCGGTGCGGA GGGCGATGTG ATCTTCCTTG AAGCCGATAC GCGGGCGGAG
CGCGACACGC TGTCCCCTCC TTTGGTCTTT GATTTTGACC TTCTGGATCA GAAACTTGAG
GGGACGATAA CGGCCAGACG CCTCTCGGAG CTGAACTTCG TGGTCTTTGA CACCGAGACG
ACCGGGTTGT CGGTTACCAA GGATGCCATC GTGGAACTCG CCGCTGCCCG TGTGCTGAAC
GGGCGCATCC TCGACGGCGA GGTCTTTGAA ACCTATGTCG ATCCAGGCCG ACCGATCCCC
GCAGCGTCCA CCAAGATACA TGGCGTACGC GATGCGGATG TGGCGCAGTC ACCCCGGATC
GAAGCGGTGA TCCCGGCGTT TCATGACTTT GCACAGAGCG CCGTTCTGGT TGCCCATAAC
GCCCCCTTTG ACATCGGGCT ATTGCGTCAA CAAGAGGTTC AGACCGGGTG CAGTTGGGAC
CATCCGGTTG TCGATACAGT GCTGTTGTCG GCCTTAGTGT TTGGGATTTC AGCGGATCAT
TCGCTGGATG CTTTGTGCAG CCGTCTTTCG ATCGAAATCC CCGCCCGGCA CCGCCATACG
GCCAAGGGAG ATGCGCGCGC TACGGCAGAA GCGCTCATAC GCCTTCTGCC GCTCTTGCAG
GGCAAGGGGA TCGAGACCTT TGGTCAACTC CTAAAGGAAA CCTCCAAATT TGGCCGACTG
CTGCGGGACA TCAACTCTGA CCACGTCCGA AACGCGCCCG AGATGAGCGA CAGCCCCTAG
 
Protein sequence
MLRSLSLRLR IFLFFCLLAV GAIALAAVAL GFVWTRSDSE WTASELTTVI LLFGFLNTGL 
VLGIWLLFDE NVARPIEGLS TSLRLRAHSG IEDSIKAQAA QYLGDLAPAA RALSDALAAS
GDQPPEQTQR LLQERERLTA LLSEIPIATI LVNAAGEIAL YDAQAGAILS RIAAPRLGAP
LSNYFDLTPA IGACARADRR LVAGHVTVPD CNHAETFNLQ IKSLGAEGDV IFLEADTRAE
RDTLSPPLVF DFDLLDQKLE GTITARRLSE LNFVVFDTET TGLSVTKDAI VELAAARVLN
GRILDGEVFE TYVDPGRPIP AASTKIHGVR DADVAQSPRI EAVIPAFHDF AQSAVLVAHN
APFDIGLLRQ QEVQTGCSWD HPVVDTVLLS ALVFGISADH SLDALCSRLS IEIPARHRHT
AKGDARATAE ALIRLLPLLQ GKGIETFGQL LKETSKFGRL LRDINSDHVR NAPEMSDSP