Gene EcE24377A_0425 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_0425 
SymbolsbcD 
ID5586171 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp448625 
End bp449827 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content53% 
IMG OID640924149 
Productexonuclease subunit SbcD 
Protein accessionYP_001461576 
Protein GI157155893 
COG category[L] Replication, recombination and repair 
COG ID[COG0420] DNA repair exonuclease 
TIGRFAM ID[TIGR00619] exonuclease SbcD 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCATCC TTCACACCTC AGACTGGCAT CTCGGCCAGA ACTTCTACAG TAAAAGCCGC 
GAAGCTGAAC ATCAGGCTTT TCTTGACTGG CTGCTGGAAA CAGCACAAAC CCATCAGGTG
GATGCGATTA TTGTTGCTGG TGATGTTTTC GATACCGGCT CGCCGCCCAG TTACGCCCGC
ACGTTATACA ACCGTTTTGT TGTCAATTTA CAGCAAACTG GCTGTCATCT GGTGGTACTG
GCAGGAAACC ATGACTCGGT CGCCACGCTG AATGAATCGC GCGATATCAT GGCGTTCCTC
AATACGACCG TGGTCGCCAG CGCCGGACAT GCGCCGCAAA TCTTGCCTCG TCGCGACGGG
ACGCCAGGCG CAGTGCTGTG CCCCATTCCG TTTTTACGTC CGCGTGACAT TATTACCAGC
CAGGCGGGGC TTAACGGTAT TGAAAAACAG CAGCATTTAC TGGCAGCGAT TACCGATTAT
TACCAACAAC AATATGCCGA TGCCTGCAAA CTGCGCGGCG ATCAGCCTCT GCCCATCATC
GCCACGGGAC ACTTAACGAC CGTGGGTGCC AGTAAAAGTG ACGCCGTGCG TGGCATTTAT
ATTGGCACGC TGGACGCGTT TCCAGCACAA AACTTTCCAC CAGCCGACTA CATCGCGCTC
GGGCATATTC ACCGCGCACA GATTATTGGC GGCATGGAAC ATGTTCGCTA TTGCGGCTCT
CCCATTCCAC TGAGTTTTGA TGAATGCGGA AAGAGTAAAT ATGTCCATCT GGTGACATTT
TCAAACGGCA AATTAGAGAG CGTGGAAAAC CTGAACGTAC CGGTAACGCA ACCCATGGCG
GTGCTGAAAG GCGATCTGGC GTCGATTACC GCACAGCTGG AACAGTGGCG CGATGTATCA
CAGGAGCCGC CTGTCTGGCT GGATATCGAA ATCATTACTG ATGAGTATCT GCATGATATT
CAGCGCAAAA TCCAGGCATT AACCGAATCA TTGCCTGTTG AAGTATTGCT GGTACGCCGG
AGTCGTGAAC AGCGCGAGCG TGTGTTAGCC AGCCAACAGC GTGAAACCCT CAGCGAACTC
AGCGTCGAAG AGGTGTTCAA TCGCCGTCTG GCACTGGAAG AACTGGATGA ATCGCAGCAG
CAACGTCTGC AGCATCTTTT CACCACGACG TTGCATACCC TCGCCGGAGA ACACGAAGCA
TGA
 
Protein sequence
MRILHTSDWH LGQNFYSKSR EAEHQAFLDW LLETAQTHQV DAIIVAGDVF DTGSPPSYAR 
TLYNRFVVNL QQTGCHLVVL AGNHDSVATL NESRDIMAFL NTTVVASAGH APQILPRRDG
TPGAVLCPIP FLRPRDIITS QAGLNGIEKQ QHLLAAITDY YQQQYADACK LRGDQPLPII
ATGHLTTVGA SKSDAVRGIY IGTLDAFPAQ NFPPADYIAL GHIHRAQIIG GMEHVRYCGS
PIPLSFDECG KSKYVHLVTF SNGKLESVEN LNVPVTQPMA VLKGDLASIT AQLEQWRDVS
QEPPVWLDIE IITDEYLHDI QRKIQALTES LPVEVLLVRR SREQRERVLA SQQRETLSEL
SVEEVFNRRL ALEELDESQQ QRLQHLFTTT LHTLAGEHEA