Gene Tpau_3874 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpau_3874 
Symbol 
ID9158055 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTsukamurella paurometabola DSM 20162 
KingdomBacteria 
Replicon accessionNC_014158 
Strand
Start bp3996380 
End bp3997600 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content71% 
IMG OID 
ProductDNA polymerase III, delta prime subunit 
Protein accessionYP_003648788 
Protein GI296141545 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGTGGGG TGTTCAGTCG GCTGGTCGGG CAGGACGCGG TGGTAGCGGA GCTGCGTGCC 
GCCGCGGAGG CCGCCCGTGA CGTGGTCGCG AGCGGGGGCA CCGTGGCCGA TCTGGCCGGA
TCCGCGATGA CCCACGCCTG GCTGTTCACC GGTCCGCCCG GCTCCGGCAG GTCGGTGGCG
GCCCGCGCAC TGGCTGCCGC GCTGCAATGC GAAGACCCGG CCGAGCCGGG CTGCGGTCAC
TGTCGCGCGT GCACCACGGT GCTCGCCGGA ACCCACGCTG ACGTGCGCGC AATCGCCCCC
GACGGGCTGT CGATCGCGGT CAAGCAGATG CGTGAAGTGG TGGCCGACGC GTCCCGCCGC
CCGTCGGTCG GACATTGGCA GATCGTGCTC ATCGAGGATG CCGACCGGCT CACCGAACAG
GCCGGTAACG CATTGCTGAA GATGGTCGAG GAGCCGCCTG CGCAGACCAT CGTCCTGTTG
TGCGCGCCCA CCGTCGACCC GGAGGACATC TCCGTCACGC TGAAGTCCCG CTGCCGGCAT
GTGCCGCTCG TCACGCCGTC GGCCCCGGCG ATCGCCGCCG TGCTCGAACG GGACGGTATC
GATGCCGAGC GGGCCGCGTG GGCGGCCGGG GTGTGCGGAG GCCATGTCGG GCGGGCGAAG
CGGCTCGCGA CCGATCCCGA GTCGCAGAAG CAGCGTCGGC AGGCCCTCAG CGTGGCGCGG
GCCGCCACCT CGGAGGGTGT GTACGGCGTT GTCGAGCAGT TGCTGCGCGA CGCGGAATCG
GCCGCCAAGG AGATCAACGC CGATCTCAAC GAGCGCGAGA CCGAAGAACT GAAGACCGCG
CTGGGCGCCG GCGGTACCGG CCGCGGAACG GCAGGTGTCA TGCGTGGTTC CGCCGGTCAG
CTCAAAGACC TGGAGAAGCG GCAGAAGGCT CGCAGTACCC GGTCGGTGCG CGACGCCCTC
GACCGCGCGC TGATCGACCT GGCCGCTCTG TTCCGGGACG CGCTGGTGCA GGGATCGGGC
GCGCAGGTCA CGCTGATGCA TCCCGACGAG GCCGAACAGA CCAGTCGGCT CGCCGGCTAC
GCGCGGCCCG AGGGCCTGCT GCGCTGCGTG GAATCGGTGC TCGACTGTCG CGAGGCGATC
GACCTCAACG TCAAGCCCGT GGTGGCACTC GATGCGATGG CGGCCGGCGT CTCGTCGGCG
CTGCGCGACT ACCGCCGTTA G
 
Protein sequence
MSGVFSRLVG QDAVVAELRA AAEAARDVVA SGGTVADLAG SAMTHAWLFT GPPGSGRSVA 
ARALAAALQC EDPAEPGCGH CRACTTVLAG THADVRAIAP DGLSIAVKQM REVVADASRR
PSVGHWQIVL IEDADRLTEQ AGNALLKMVE EPPAQTIVLL CAPTVDPEDI SVTLKSRCRH
VPLVTPSAPA IAAVLERDGI DAERAAWAAG VCGGHVGRAK RLATDPESQK QRRQALSVAR
AATSEGVYGV VEQLLRDAES AAKEINADLN ERETEELKTA LGAGGTGRGT AGVMRGSAGQ
LKDLEKRQKA RSTRSVRDAL DRALIDLAAL FRDALVQGSG AQVTLMHPDE AEQTSRLAGY
ARPEGLLRCV ESVLDCREAI DLNVKPVVAL DAMAAGVSSA LRDYRR