Gene B21_00426 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_00426 
SymboldnaX 
ID8115402 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp461040 
End bp462971 
Gene Length1932 bp 
Protein Length643 aa 
Translation table11 
GC content58% 
IMG OID644846710 
Producthypothetical protein 
Protein accessionYP_002998283 
Protein GI251783979 
COG category[L] Replication, recombination and repair 
COG ID[COG2812] DNA polymerase III, gamma/tau subunits 
TIGRFAM ID[TIGR00678] DNA polymerase III, delta' subunit
[TIGR02397] DNA polymerase III, subunit gamma and tau 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00196297 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTATC AGGTCTTAGC CCGAAAATGG CGCCCACAAA CCTTTGCTGA CGTCGTCGGC 
CAGGAACATG TGCTGACCGC ACTGGCGAAC GGCTTGTCGT TAGGGCGTAT TCATCATGCT
TATCTTTTTT CCGGCACCCG TGGCGTCGGA AAAACCTCTA TCGCCCGACT GCTGGCGAAG
GGGCTAAACT GCGAAACCGG CATTACCGCG ACGCCGTGCG GCGTGTGCGA TAACTGTCGT
GAAATCGAGC AGGGGCGCTT TGTCGATCTG ATTGAAATCG ACGCCGCCTC GCGCACCAAA
GTTGAAGATA CCCGCGACCT GCTGGATAAC GTCCAGTACG CTCCGGCGCG TGGTCGTTTC
AAAGTTTATC TGATCGACGA AGTGCATATG CTGTCGCGCC ACAGCTTTAA CGCACTGTTA
AAAACCCTTG AAGAGCCGCC GGAGCACGTT AAGTTTCTGC TGGCGACGAC CGATCCACAG
AAATTGCCGG TGACGATTTT GTCACGCTGT CTGCAATTTC ATCTCAAGGC GCTGGATGTC
GAGCAAATTC GCCATCAGCT TGAGCACATC CTCAACGAAG AACATATCGC TCACGAGCCG
CGGGCGCTGC AATTGCTGGC ACGCGCCGCT GAAGGCAGCC TGCGAGATGC CTTAAGTCTG
ACCGACCAGG CGATTGCCAG CGGTGACGGC CAGGTTTCAA CCCAGGCGGT CAGTGCGATG
CTGGGTACGC TTGACGACGA TCAGGCGCTG TCGCTGGTTG AAGCGATGGT CGAGGCCAAC
GGCGAGCGCG TAATGGCGCT GATTAATGAA GCCGCTGCCC GTGGTATCGA GTGGGAAGCG
TTGCTGGTGG AAATGCTCGG CCTGTTGCAT CGTATTGCGA TGGTACAACT TTCGCCTGCT
GCACTTGGCA ACGACATGGC CGCCATCGAG CTGCGGATGC GTGAACTGGC GCGCACCATA
CCGCCGACGG ATATTCAGCT TTACTATCAG ACGCTGTTGA TTGGTCGCAA AGAATTACCG
TATGCGCCGG ACCGTCGCAT GGGCGTTGAG ATGACGCTGC TGCGCGCGCT GGCATTCCAT
CCGCGTATGC CGCTGCCTGA GCCAGAAGTG CCACGACAGT CCTTTGCACC CGTCGCGCCA
ACGGCAGTAA TGACGCCAAC CCAGGTGCCG CCGCAACCGC AATCAGCGCC GCAGCAGGCA
CCGACTGTAC CGCTCCCGGA AACCACCAGC CAGGTGCTGG CGGCGCGCCA GCAGTTGCAG
CGCGTGCAGG GAGCAACCAA AGCAAAAAAG AGTGAACCGG CAGCCGCTAC CCGCGCGCGG
CCGGTGAATA ACGCTGCGCT GGAAAGACTG GCTTCGGTCA CCGATCGCGT TCAGGCGCGT
CCGGTGCCAT CGGCGCTGGA AAAAGCGCCA GCCAAAAAAG AAGCGTATCG CTGGAAGGCG
ACCACTCCGG TGATGCAGCA AAAAGAAGTG GTCGCCACGC CGAAGGCGCT GAAAAAAGCG
CTGGAACATG AAAAAACGCC GGAACTGGCG GCGAAGCTAG CGGCAGAAGC CATTGAGCGC
GACCCGTGGG CGGCACAGGT GAGCCAACTT TCGCTACCAA AACTGGTCGA ACAGGTGGCG
TTAAATGCCT GGAAAGAGGA GAGCGACAAC GCAGTATGTC TGCATTTGCG CTCCTCTCAG
CGGCATTTGA ACAACCGCGG TGCACAGCAA AAACTGGCTG AAGCGTTGAG CATGTTAAAA
GGTTCAACGG TTGAACTGAC TATCGTTGAA GATGATAATC CCGCGGTGCG TACGCCGCTG
GAGTGGCGTC AGGCGATATA CGAAGAAAAA CTTGCGCAGG CGCGCGAGTC CATTATTGCG
GATAATAATA TTCAGACCCT GCGTCGGTTC TTCGATGCGG AGCTGGATGA AGAAAGTATC
CGCCCCATTT GA
 
Protein sequence
MSYQVLARKW RPQTFADVVG QEHVLTALAN GLSLGRIHHA YLFSGTRGVG KTSIARLLAK 
GLNCETGITA TPCGVCDNCR EIEQGRFVDL IEIDAASRTK VEDTRDLLDN VQYAPARGRF
KVYLIDEVHM LSRHSFNALL KTLEEPPEHV KFLLATTDPQ KLPVTILSRC LQFHLKALDV
EQIRHQLEHI LNEEHIAHEP RALQLLARAA EGSLRDALSL TDQAIASGDG QVSTQAVSAM
LGTLDDDQAL SLVEAMVEAN GERVMALINE AAARGIEWEA LLVEMLGLLH RIAMVQLSPA
ALGNDMAAIE LRMRELARTI PPTDIQLYYQ TLLIGRKELP YAPDRRMGVE MTLLRALAFH
PRMPLPEPEV PRQSFAPVAP TAVMTPTQVP PQPQSAPQQA PTVPLPETTS QVLAARQQLQ
RVQGATKAKK SEPAAATRAR PVNNAALERL ASVTDRVQAR PVPSALEKAP AKKEAYRWKA
TTPVMQQKEV VATPKALKKA LEHEKTPELA AKLAAEAIER DPWAAQVSQL SLPKLVEQVA
LNAWKEESDN AVCLHLRSSQ RHLNNRGAQQ KLAEALSMLK GSTVELTIVE DDNPAVRTPL
EWRQAIYEEK LAQARESIIA DNNIQTLRRF FDAELDEESI RPI