Gene EcE24377A_2302 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_2302 
SymbolsbcB 
ID5588345 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp2265419 
End bp2266846 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content50% 
IMG OID640925967 
Productexonuclease I 
Protein accessionYP_001463362 
Protein GI157155858 
COG category[L] Replication, recombination and repair 
COG ID[COG2925] Exonuclease I 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGAATG ACGGTAAGCA ACAATCTACC TTTTTGTTTC ACGATTACGA AACCTTTGGC 
ACGCACCCCG CGTTAGATCG CCCTGCACAG TTCGCAGCCA TTCGCACCGA TAGCGAATTC
AATGTCATCG GCGAACCCGA AGTCTTTTAC TGCAAGCCCG CGGATGACTA TTTGCCCCAG
CCTGGAGCAG TATTAATTAC CGGTATTACC CCGCAGGAAG CACGGGCGAA AGGAGAAAAC
GAAGCCGCGT TTGCCGCCCG TATTCACTCG CTTTTTAACG TACCGAAGAC CTGTATTCTG
GGCTACAACA ATGTGCGTTT CGACGACGAA GTCACACGCA ACGTTTTTTA TCGTAATTTC
TACGATCCTT ACGCCTGGAG CTGGCAGCAT GATAACTCGC GCTGGGATTT GCTGGATGTT
ATGCGTGCCT GTTATGCCCT GCGCCCGGAA GGAATAAACT GGCCTGAAAA TGATGACGGT
CTACCGAGCT TTCGCCTTGA GCATTTAACC AAAGCGAATG GTATTGAACA TAGCAACGCC
CACGATGCGA TGGCTGATGT GTACGCCACT ATTGCGATGG CAAAGCTGGT AAAAACGCGT
CAGCCACGCC TGTTTGATTA TCTCTTTACC CATCGTAATA AACACAAACT GATAGCGTTG
ATTGATGTTC CGCAGATGAA ACCCCTGGTG CACGTTTCCG GAATGTTTGG AGCATGGCGC
GGCAATACCA GCTGGGTGGC ACCGCTGGCG TGGCATCCTG AAAATCGCAA TGCCGTAATT
ATGGTGGATT TGGCAGGAGA CATTTCGCCA TTACTGGAAC TGGATAGCGA CACATTGCGC
GAGCGTTTAT ATACCGCAAA AACCGATCTT GGCGATAACG CCGCCGTTCC GGTTAAGCTG
GTGCATATCA ATAAATGTCC GGTGCTGGCC CAGGCGAATA CGCTACGCCC GGAAGATGCC
GACCGACTGG GAATTAATCG TCAGCATTGC CTCGATAACC TGAAAATTCT GCGTGAAAAT
CCGCAAGTGC GCGAAAAAGT GGTGGCGATA TTCGCGGAAG CCGAACCGTT TACGCCTTCA
GATAACGTGG ATGCACAGCT TTATAACGGC TTTTTCAGTG ACGCCGATCG TGCAGCAATG
AAAATTGTGC TGGAAACCGA GCCGCGTAAT TTACCGGCAC TGGATATCAC TTTTGTCGAT
AAACGGATTG AAAAGCTGTT GTTCAATTAT CGGGCGCGCA ACTTCCCGGG GACGCTGGAT
TATGCCGAGC AGCAACGCTG GCTGGAGCAC CGCCGCCAGG TCTTCACGCC CGAGTTTTTA
CAAGGTTATG CAGATGAATT GCAGATGCTG GCACAACAAT ATGCCGATAA CAAAGAGAAA
GTGGCGCTGT TAAAAGCGCT TTGGCAGTAC GCGGAAGAGA TCGTCTAA
 
Protein sequence
MMNDGKQQST FLFHDYETFG THPALDRPAQ FAAIRTDSEF NVIGEPEVFY CKPADDYLPQ 
PGAVLITGIT PQEARAKGEN EAAFAARIHS LFNVPKTCIL GYNNVRFDDE VTRNVFYRNF
YDPYAWSWQH DNSRWDLLDV MRACYALRPE GINWPENDDG LPSFRLEHLT KANGIEHSNA
HDAMADVYAT IAMAKLVKTR QPRLFDYLFT HRNKHKLIAL IDVPQMKPLV HVSGMFGAWR
GNTSWVAPLA WHPENRNAVI MVDLAGDISP LLELDSDTLR ERLYTAKTDL GDNAAVPVKL
VHINKCPVLA QANTLRPEDA DRLGINRQHC LDNLKILREN PQVREKVVAI FAEAEPFTPS
DNVDAQLYNG FFSDADRAAM KIVLETEPRN LPALDITFVD KRIEKLLFNY RARNFPGTLD
YAEQQRWLEH RRQVFTPEFL QGYADELQML AQQYADNKEK VALLKALWQY AEEIV