Gene EcSMS35_0513 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0513 
SymboldnaX 
ID6143044 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp520405 
End bp522336 
Gene Length1932 bp 
Protein Length643 aa 
Translation table11 
GC content57% 
IMG OID641615407 
ProductDNA polymerase III subunits gamma and tau 
Protein accessionYP_001742614 
Protein GI170681602 
COG category[L] Replication, recombination and repair 
COG ID[COG2812] DNA polymerase III, gamma/tau subunits 
TIGRFAM ID[TIGR00678] DNA polymerase III, delta' subunit
[TIGR02397] DNA polymerase III, subunit gamma and tau 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0325277 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones60 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTATC AGGTCTTAGC CCGAAAATGG CGCCCACAAA CCTTTGCTGA CGTCGTCGGC 
CAGGAACATG TGCTGACCGC ACTGGCGAAC GGCTTGTCGT TAGGGCGTAT TCATCATGCT
TATCTTTTTT CCGGTACCCG TGGCGTCGGA AAAACCTCTA TCGCCCGACT GCTGGCGAAG
GGGCTAAACT GCGAAACCGG CATTACCGCG ACGCCGTGCG GCGTGTGCGA TAACTGTCGT
GAAATCGAGC AGGGGCGCTT TGTCGATCTG ATTGAAATCG ACGCCGCCTC GCGTACTAAA
GTTGAAGACA CTCGCGACCT GCTGGATAAC GTCCAGTACG CTCCGGCGCG TGGTCGTTTC
AAAGTTTATC TCATCGACGA AGTGCATATG CTGTCGCGCC ACAGCTTTAA CGCACTGTTA
AAAACCCTTG AAGAGCCGCC GGAGCACGTT AAGTTTCTGC TGGCGACGAC CGATCCGCAG
AAATTGCCGG TGACGATTTT GTCGCGCTGT CTGCAATTCC ATCTCAAGGC GCTGGATGTC
GAGCAAATTC GCCATCAGCT TGAGCACATC CTCAACGAAG AACATATCGC TCACGAGCCG
CGGGCGTTGC AATTACTGGC GCGCGCCGCT GAAGGCAGCT TGCGAGATGC CTTAAGTCTG
ACCGACCAGG CGATTGCCAG CGGTGACGGC CAGGTTTCAA CCCAGGCCGT CAGTGCGATG
CTGGGTACGC TTGACGACGA TCAGGCGTTG TCGCTGGTTG AAGCGATGGT CGAGGCCAAC
GGCGAGCGCG TAATGGCGCT AATTAATGAA GCTGCTGCCC GTGGTATCGA GTGGGAAGCG
TTGCTGGTGG AAATGCTCGG TTTGTTGCAT CGTATTGCGA TGGTACAACT TTCGCCTGCT
GCACTTGGCA ACGACATGGC CGCCATCGAG CTGCGGATGC GTGAACTGGC GCGTACCATA
CCGCCGACGG ATATTCAGCT TTACTATCAA ACGCTGTTGA TTGGTCGCAA AGAATTACCG
TATGCGCCGG ACCGCCGCAT GGGCGTTGAG ATGACGCTGC TGCGCGCGCT GGCGTTCCAT
CCGCGTATGC CTCTGCCTGA GCCAGAAGTG CCACGCCAGT CCTTTGCGCC CGTTGCGCCA
ACGGCAGTAA TGACGCCAAC CCAGGTGCCG CCGCAACCGC AATCAGCGCC GCAGCAGGCA
CCGACTGTAC CGCTCCCGGA AACCACCAGC CAGGTGCTGG CGGCGCGCCA GCAGTTGCAG
CGCGTGCAGG GAGCAACCAA AGCAAAAAAG AGTGAACCGG CAGCCGCTAC CCGCGCGCGG
CCGGTGAATA ACGCTGCGCT GGAAAGACTG GCTTCGGTCA CCGATCGCGT TCAGGCGCGC
CCGGTGCCAT CGGCGCTGGA AAAAGCGCCA GCCAAAAAAG AAGCGTATCG CTGGAAGGCG
ACCACTCCAG TGATGCAGCA AAAAGAAGTG GTCGCCACGC CGAAGGCGCT GAAAAAAGCG
CTGGAACATG AAAAAACGCC GGAACTGGCG GCGAAGCTGG CGGCAGAAGC CATTGAGCGC
GACCCGTGGG CGGCTCAGGT GAGTCAACTT TCGCTACCAA AACTGGTCGA ACAGGTGGCG
TTAAATGCCT GGAAAGAGGA GAGCGACAAC GCAGTATGTC TGCATTTGCG CTCCTCTCAG
CGGCATTTGA ACAACCGCGG CGCACAGCAA AAACTGGCTG AAGCGTTGAG CACGTTAAAA
GGTTCAACGG TTGAACTGAC TATCGTTGAA GATGATAATC CCGCGGTGCG TACGCCGCTG
GAATGGCGTC AGGCGATATA CGAAGAAAAA CTTGCGCAGG CGCGCGAGTC CATTATTGCG
GATAATAATA TTCAGACCCT GCGTCGATTC TTCGATGCGG AGCTGGATGA AGAAAGTATC
CGCCCCATTT GA
 
Protein sequence
MSYQVLARKW RPQTFADVVG QEHVLTALAN GLSLGRIHHA YLFSGTRGVG KTSIARLLAK 
GLNCETGITA TPCGVCDNCR EIEQGRFVDL IEIDAASRTK VEDTRDLLDN VQYAPARGRF
KVYLIDEVHM LSRHSFNALL KTLEEPPEHV KFLLATTDPQ KLPVTILSRC LQFHLKALDV
EQIRHQLEHI LNEEHIAHEP RALQLLARAA EGSLRDALSL TDQAIASGDG QVSTQAVSAM
LGTLDDDQAL SLVEAMVEAN GERVMALINE AAARGIEWEA LLVEMLGLLH RIAMVQLSPA
ALGNDMAAIE LRMRELARTI PPTDIQLYYQ TLLIGRKELP YAPDRRMGVE MTLLRALAFH
PRMPLPEPEV PRQSFAPVAP TAVMTPTQVP PQPQSAPQQA PTVPLPETTS QVLAARQQLQ
RVQGATKAKK SEPAAATRAR PVNNAALERL ASVTDRVQAR PVPSALEKAP AKKEAYRWKA
TTPVMQQKEV VATPKALKKA LEHEKTPELA AKLAAEAIER DPWAAQVSQL SLPKLVEQVA
LNAWKEESDN AVCLHLRSSQ RHLNNRGAQQ KLAEALSTLK GSTVELTIVE DDNPAVRTPL
EWRQAIYEEK LAQARESIIA DNNIQTLRRF FDAELDEESI RPI