Gene B21_00350 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_00350 
SymbolsbcD 
ID8116713 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp381053 
End bp382255 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content53% 
IMG OID644846634 
Producthypothetical protein 
Protein accessionYP_002998207 
Protein GI251783903 
COG category[L] Replication, recombination and repair 
COG ID[COG0420] DNA repair exonuclease 
TIGRFAM ID[TIGR00619] exonuclease SbcD 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCATCC TTCACACCTC AGACTGGCAT CTCGGCCAGA ACTTCTACAG TAAAAGCCGC 
GAAGCTGAAC ATCAGGCTTT TCTTGACTGG CTGCTGGAAA CAGCACAAAC CCATCAGGTG
GATGCGATTA TTGTTGCTGG TGATGTTTTC GATACCGGCT CGCCGCCCAG TTACGCCCGC
ACGTTATACA ACCGTTTTGT TGTCAATTTA CAGCAAACTG GCTGTCATCT GGTGGTACTG
GCAGGAAACC ATGACTCGGT CGCCACGCTG AATGAATCGC GCGATATCAT GGCGTTCCTC
AATACGACCG TGGTCGCCAG CGCCGGACAT GCGCCGCAAA TCTTGCCTCG TCGCGACGGG
ACGCCAGGCG CAGTGCTGTG CCCCATTCCG TTTTTACGTC CGCGTGACAT TATTACCAGC
CAGGCGGGGC TTAACGGTAT TGAAAAACAG CAGCATTTAC TGGCAGCGAT TACCGATTAT
TACCAACAAC ACTATGCCGA TGCCTGCAAA CTGCGCGGCG ATCAGCCTCT GCCCATCATC
GCCACGGGAC ACTTAACGAC CGTGGGTGCC AGTAAAAGTG ACGCCGTGCG TGACATTTAT
ATTGGCACGC TGGACGCGTT TCCAGCACAA AACTTTCCAC CAGCCGACTA CATCGCGCTC
GGGCATATTC ACCGCGCACA GATTATTGGC GGCATGGAAC ATGTTCGCTA TTGCGGCTCT
CCCATTCCAC TGAGTTTTGA TGAATGCGGA AAGAGTAAAT ATGTCCATCT GGTGACATTT
TCAAACGGCA AATTAGAGAG CGTGGAAAAC CTGAACGTAC CGGTAACGCA ACCCATGGCG
GTGCTGAAAG GCGATCTGGC GTCGATTACC GCACAGCTGG AACAGTGGCG CGATGTATCA
CAGGAGCCGC CTGTCTGGCT GGATATCGAA ATCACTACTG ATGAGTATCT GCATGATATT
CAGCGCAAAA TCCAGGCATT AACCGAATCA TTGCCTGTTG AAGTATTGCT GGTACGCCGG
AGTCGTGAAC AGCGCGAGCG TGTGTTAGCC AGCCAACAGC GTGAAACCCT CAGCGAACTC
AGCGTCGAAG AGGTGTTCAA TCGCCGTCTG GCACTGGAAG AACTGGATGA ATCGCAGCAG
CAACGTCTGC AGCATCTTTT CACCACGACG TTGCATACCC TCGCCGGAGA ACACGAAGCA
TGA
 
Protein sequence
MRILHTSDWH LGQNFYSKSR EAEHQAFLDW LLETAQTHQV DAIIVAGDVF DTGSPPSYAR 
TLYNRFVVNL QQTGCHLVVL AGNHDSVATL NESRDIMAFL NTTVVASAGH APQILPRRDG
TPGAVLCPIP FLRPRDIITS QAGLNGIEKQ QHLLAAITDY YQQHYADACK LRGDQPLPII
ATGHLTTVGA SKSDAVRDIY IGTLDAFPAQ NFPPADYIAL GHIHRAQIIG GMEHVRYCGS
PIPLSFDECG KSKYVHLVTF SNGKLESVEN LNVPVTQPMA VLKGDLASIT AQLEQWRDVS
QEPPVWLDIE ITTDEYLHDI QRKIQALTES LPVEVLLVRR SREQRERVLA SQQRETLSEL
SVEEVFNRRL ALEELDESQQ QRLQHLFTTT LHTLAGEHEA