Gene EcSMS35_0427 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0427 
SymbolsbcD 
ID6146676 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp438025 
End bp439227 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content53% 
IMG OID641615323 
Productexonuclease subunit SbcD 
Protein accessionYP_001742530 
Protein GI170683006 
COG category[L] Replication, recombination and repair 
COG ID[COG0420] DNA repair exonuclease 
TIGRFAM ID[TIGR00619] exonuclease SbcD 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.429993 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCATCC TTCACACCTC AGACTGGCAT CTCGGCCAGA ACTTCTACAG TAAAAGCCGC 
GAAGCTGAAC ATCAGGCTTT TCTTGACTGG CTGCTGGAGA CAGCACAAAC CCATCAGGTG
GATGCGATTA TTGTTGCTGG TGATGTTTTC GATACCGGCT CACCGCCCAG TTACGCCCGC
ACGTTATACA ACCGTTTTGT TGTCAATTTA CAGCAAACTG GCTGTCATCT GGTGGTACTG
GCAGGAAACC ATGACTCGGT CGCCACGCTG AATGAATCGC GCGATATCAT GGCGTTCCTC
AATACTACCG TGGTCGCCAG CGCCGGACAT GCGCCGCAAA TCTTGCCTCG TCGCGACGGG
ACGCCAGGCG CAGTGCTGTG CCCCATACCT TTTTTACGCC CGCGTGACAT TATTACCAGC
CAGGCGGGGC TTAACGGTAT TGAAAAACAG CAGCATTTGC TGGCGGCTAT TACTGATTAT
TATCAACAAC ATTATGCCGA TGCCTGCAAA CTGCGCGGCG ATCAGCCTCT GCCCATCATC
GCCACGGGAC ATTTAACGAC CGTGGGTGCC AGTAAAAGTG ACGCCGTGCG TGACATTTAT
ATTGGCACGC TGGACGCGTT TCCGGCACAA AACTTTCCAC CAGCCGACTA CATCGCGCTC
GGGCATATTC ACCGCGCGCA GATTATTGGC GGCATGGAAC ATGTTCGCTA TTGCGGTTCC
CCCATACCAC TGAGTTTTGA TGAATGCGGT AAGAGTAAAT ATGTCCATCT GGTGACATTT
TCAAACGGCA AATTAGAGAG CGTGGAAAAC CTGAACGTAC CGGTAACGCA ACCCATGGCG
GTGCTGAAAG GCGATCTGGC GTCGATTACC GAACAGCTGG AACAGTGGCG CGATGTATCA
CAGGAGCCAC CTGTCTGGCT GGATATCGAA ATCACTACTG ATGAGTATCT GCATGATATT
CAGCGCAAAA TCCAGGCATT AACCGAATCA TTGCCCGTTG AAGTATTGCT GGTACGCCGG
AGTCGTGAAC AGCGCGAGCG TGTGTTAGCC AGCCAACAGC GTGAAACCCT CAGCGAACTC
AGCGTCGAAG AGGTGTTCAA TCGCCGTCTG GCACTGGAAG AACTGGATGA ATCACAGCAG
CAACGTCTGC AGCATCTTTT CGCCACGACA TTGCATAGCC TCGCCGGAGA ACACGAAGCA
TGA
 
Protein sequence
MRILHTSDWH LGQNFYSKSR EAEHQAFLDW LLETAQTHQV DAIIVAGDVF DTGSPPSYAR 
TLYNRFVVNL QQTGCHLVVL AGNHDSVATL NESRDIMAFL NTTVVASAGH APQILPRRDG
TPGAVLCPIP FLRPRDIITS QAGLNGIEKQ QHLLAAITDY YQQHYADACK LRGDQPLPII
ATGHLTTVGA SKSDAVRDIY IGTLDAFPAQ NFPPADYIAL GHIHRAQIIG GMEHVRYCGS
PIPLSFDECG KSKYVHLVTF SNGKLESVEN LNVPVTQPMA VLKGDLASIT EQLEQWRDVS
QEPPVWLDIE ITTDEYLHDI QRKIQALTES LPVEVLLVRR SREQRERVLA SQQRETLSEL
SVEEVFNRRL ALEELDESQQ QRLQHLFATT LHSLAGEHEA