Gene EcolC_1629 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1629 
SymbolsbcB 
ID6066380 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1810748 
End bp1812175 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content51% 
IMG OID641601044 
Productexonuclease I 
Protein accessionYP_001724614 
Protein GI170019660 
COG category[L] Replication, recombination and repair 
COG ID[COG2925] Exonuclease I 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.186939 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAATG ACGGTAAGCA ACAATCTACC TTTTTGTTTC ACGATTACGA AACCTTTGGC 
ACGCACCCCG CGTTAGATCG CCCTGCACAG TTCGCAGCCA TTCGCACCGA TAGCGAATTC
AATGTCATCG GCGAACCCGA AGTCTTTTAC TGCAAGCCCG CTGATGACTA TTTACCCCAG
CCTGGAGCAG TATTAATTAC CGGTATTACC CCGCAGGAAG CACGGGCGAA AGGAGAAAAC
GAAGCCGCGT TTGCCGCCCG TATTCACTCG CTTTTTACCG TACCGAAGAC CTGTATTCTG
GGCTACAACA ATGTGCGTTT CGACGACGAA GTCACACGCA ACGTTTTTTA TCGTAATTTC
TACGATCCTT ACGCCTGGAG CTGGCAGCAT GATAACTCGC GCTGGGATTT ACTGGATGTT
ATGCGTGCCT GTTATGCCCT GCGCCCGGAA GGAATAAACT GGCCTGAAAA TGATGACGGT
CTACCGAGCT TTCGCCTTGA GCATTTAACC AAAGCGAATG GTATTGAACA TAGCAACGCC
CACGATGCGA TGGCTGATGT GTACGCCACT ATTGCGATGG CAAAGCTGGT AAAAACGCGT
CAGCCACGCC TGTTTGATTA TCTCTTTACC CATCGTAATA AACACAAACT GATGGCGTTG
ATTGATGTTC CGCAGATGAA ACCCCTGGTG CACGTTTCCG GAATGTTTGG GGCATGGCGC
GGCAACACCA GCTGGGTGGC ACCGCTGGCG TGGCATCCTG AAAATCGCAA TGCCGTAATT
ATGGTGGATT TGGCAGGAGA CATTTCGCCA TTACTGGAAC TGGATAGCGA CACATTGCGC
GAGCGTTTAT ATACCGCAAA AACCGATCTT GGCGATAACG CCGCCGTTCC GGTTAAGCTG
GTGCACATCA ATAAATGTCC GGTGCTGGCC CAGGCGAATA CGCTACGCCC GGAAGATGCC
GACCGACTGG GAATTAATCG TCAGCATTGC CTCGATAACC TGAAAATTCT GCGTGAAAAT
CCGCAAGTGC GCGAAAAAGT GGTGGCGATA TTCGCGGAAG CCGAACCGTT TACGCCTTCA
GATAACGTGG ATGCACAGCT TTATAACGGC TTTTTCAGTG ACGCCGATCG TGCAGCAATG
AAAATTGTGC TGGAAACCGA GCCGCGTAAT TTACCGGCAC TGGATATCAC TTTTGTCGAT
AAACGGATTG AAAAGCTGTT GTTCAATTAT CGGGCGCGCA ACTTCCCGGG GACGCTGGAT
TATGCCGAGC AGCAACGCTG GCTGGAGCAC CGCCGCCAGG TATTCACACC CGAGTTTTTA
CAAGGCTATG CAGAAGAGAT TCAGATGCTG GCGCAGCAGT ATGCCGACGA TAAAGAAAAA
GTGGCGCTGT TAAAAGCGCT TTGGCAGTAC GCGGAAGAGA TTGTCTAG
 
Protein sequence
MMNDGKQQST FLFHDYETFG THPALDRPAQ FAAIRTDSEF NVIGEPEVFY CKPADDYLPQ 
PGAVLITGIT PQEARAKGEN EAAFAARIHS LFTVPKTCIL GYNNVRFDDE VTRNVFYRNF
YDPYAWSWQH DNSRWDLLDV MRACYALRPE GINWPENDDG LPSFRLEHLT KANGIEHSNA
HDAMADVYAT IAMAKLVKTR QPRLFDYLFT HRNKHKLMAL IDVPQMKPLV HVSGMFGAWR
GNTSWVAPLA WHPENRNAVI MVDLAGDISP LLELDSDTLR ERLYTAKTDL GDNAAVPVKL
VHINKCPVLA QANTLRPEDA DRLGINRQHC LDNLKILREN PQVREKVVAI FAEAEPFTPS
DNVDAQLYNG FFSDADRAAM KIVLETEPRN LPALDITFVD KRIEKLLFNY RARNFPGTLD
YAEQQRWLEH RRQVFTPEFL QGYAEEIQML AQQYADDKEK VALLKALWQY AEEIV