Gene B21_00229 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_00229 
SymboldinB 
ID8115445 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp255458 
End bp256513 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content54% 
IMG OID644846519 
Producthypothetical protein 
Protein accessionYP_002998092 
Protein GI251783788 
COG category[L] Replication, recombination and repair 
COG ID[COG0389] Nucleotidyltransferase/DNA polymerase involved in DNA repair 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTAAAA TCATTCATGT GGATATGGAC TGCTTTTTCG CAGCGGTGGA GATGCGCGAC 
AATCCCGCCC TGCGCGATAT CCCTATTGCT ATTGGCGGCA GCCGCGAACG TCGGGGGGTG
ATCAGTACCG CCAATTATCC CGCGCGTAAA TTTGGCGTAC GTAGCGCTAT GCCGACAGGG
ATGGCGCTCA AATTATGCCC GCATCTCACC TTGCTTCCGG GGCGCTTTGA CGCCTACAAA
GAAGCCTCAA ATCATATCCG CGAAATCTTC TCGCGCTACA CCTCGCGTAT TGAACCGTTG
TCACTGGATG AGGCTTATCT CGACGTCACC GATAGCGTCC ATTGCCACGG TTCTGCGACC
CTCATCGCCC AGGAAATCCG CCAGACGATT TTCAACGAGC TGCAACTGAC GGCGTCTGCG
GGCGTGGCAC CCGTAAAGTT TCTCGCCAAA ATCGCCTCCG ACATGAATAA ACCCAACGGC
CAGTTTGTGA TTACGCCGGC AGAAGTTCCG GCATTTTTAC AAACCTTACC ACTGGCAAAA
ATCCCCGGCG TCGGCAAAGT CTCGGCGGCA AAACTGGAAG CGATGGGGCT ACGAACCTGC
GGTGATGTAC AAAAGTGTGA TCTGGTGATG CTGCTTAAAC GCTTTGGCAA ATTTGGCCGC
ATTTTGTGGG AGCGTAGTCA GGGGATTGAC GAGCGCGACG TTAACAGCGA ACGGTTGCGA
AAATCCGTCG GCGTGGAACG CACGATGGCG GAAGATATCC ACCACTGGTC TGAATGTGAA
GCGATTATCG AGCGGCTGTA TCCGGAACTT GAACGCCGTC TGGCAAAGGT GAAACCTGAT
TTACTGATTG CTCGCCAGGG GGTGAAATTA AAGTTTGATG ATTTTCAGCA AACCACTCAG
GAGCACGTCT GGCCGCGGCT GAATAAAGCT GACTTAATCG CCACCGCGCG TAAAACCTGG
GATGAACGCC GCGGCGGGCG CGGTGTGCGA CTGGTGGGGC TGCATGTGAC GTTGCTTGAT
CCGCAAATGG AAAGACAACT GGTGCTGGGA TTATGA
 
Protein sequence
MRKIIHVDMD CFFAAVEMRD NPALRDIPIA IGGSRERRGV ISTANYPARK FGVRSAMPTG 
MALKLCPHLT LLPGRFDAYK EASNHIREIF SRYTSRIEPL SLDEAYLDVT DSVHCHGSAT
LIAQEIRQTI FNELQLTASA GVAPVKFLAK IASDMNKPNG QFVITPAEVP AFLQTLPLAK
IPGVGKVSAA KLEAMGLRTC GDVQKCDLVM LLKRFGKFGR ILWERSQGID ERDVNSERLR
KSVGVERTMA EDIHHWSECE AIIERLYPEL ERRLAKVKPD LLIARQGVKL KFDDFQQTTQ
EHVWPRLNKA DLIATARKTW DERRGGRGVR LVGLHVTLLD PQMERQLVLG L