Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A2149 |
Symbol | sbcB |
ID | 5594313 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 2128610 |
End bp | 2130037 |
Gene Length | 1428 bp |
Protein Length | 475 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640921282 |
Product | exonuclease I |
Protein accession | YP_001458821 |
Protein GI | 157161503 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG2925] Exonuclease I |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 58 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGAATG ACGGTAAGCA ACAATCTACC TTTTTGTTTC ACGATTACGA AACCTTTGGC ACGCATCCCG CGTTAGATCG CCCTGCACAG TTCGCAGCCA TTCGCACCGA TAGCGAATTC AATGTCATCG GCGAACCCGA AGTCTTTTAC TGCAAGCCCG CGGATGACTA TTTACCCCAG CCTGGAGCAG TATTAATTAC CGGTATTACC CCGCAGGAAG CTCGGGCGAA AGGAGAAAAC GAAGCCGCAT TTGCCGCCCG TATTCACTCG CTTTTTACCG TACCGAAGAC CTGTATTCTG GGCTACAACA ATGTGCGTTT CGACGATGAA GTCACACGCA ACGTTTTTTA TCGTAATTTC TACGATCCTT ACGCCTGGAG CTGGCAGCAT GATAACTCGC GCTGGGATTT ACTGGATGTT ATGCGTGCCT GTTATGCCCT GCGCCCGGAA GGAATAAACT GGCCTGAAAA TGATGACGGT CTACCGAGCT TTCGCCTTGA GCATTTAACC AAAGCGAATG GTATTGAACA TAGCAACGCC CACGATGCGA TGGCTGATGT GTACGCCACT ATTGCGATGG CAAAGCTGGT AAAAACGCGT CAGCCACGCC TGTTTGATTA TCTCTTTACC CATCGCAATA AACACAAACT GATGGCGTTG ATTGATGTTC CGCAGATGAA ACCCCTGGTT CACGTTTCCG GAATGTTTGG GGCATGGCGC GGCAATACCA GCTGGGTAGC ACCGCTGGCG TGGCATCCTG AAAATCGCAA TGCCGTAATT ATGGTGGATT TGGCAGGAGA CATTTCGCCA TTACTGGAAC TGGATAGCGA CACATTGCGC GAGCGTCTAT ATACCGCAAA AGCCGATCTT GGCGATAACG CCGCCGTTCC GGTTAAACTG GTGCATATCA ATAAATGTCC GGTGCTGGCC CAGGCGAATA CGCTACGCCC GGAAGATGCC GACCGACTGG GAATTAATCG TCAGCATTGC CTCGATAACC TGAAAATTCT GCGTGAAAAT CCGCAAGTGC GTGAAAAAGT GGTGGCGATA TTCGCGGAAG CCGAACCGTT TACGCCTTCA GATAACGTGG ATGCACAGCT TTATAATGGC TTTTTCAGTG ACGCCGATCG TGCAGCAATG AAAATTGTGC TTGAAACCGA GCCGCGTAAT TTACCGGCAC TGGATATCAC TTTTGTTGAT AAACGGATTG AAAAGCTGTT GTTCAATTAT CGGGCACGCA ACTTCCCGGG GACGCTGGAT TATGCCGAGC AGCAACGCTG GCTGGAGCAC CGCCGCCAGG TCTTCACGCC AGAGTTTTTA CAAGGCTATG CAGAAGAGAT TCAGATGCTG GCGCAGCAGT ATGCCGACGA TAAAGAAAAA GTGGCGCTGT TAAAAGCGCT TTGGCAGTAC GCGGAAGAGA TTGTCTAA
|
Protein sequence | MMNDGKQQST FLFHDYETFG THPALDRPAQ FAAIRTDSEF NVIGEPEVFY CKPADDYLPQ PGAVLITGIT PQEARAKGEN EAAFAARIHS LFTVPKTCIL GYNNVRFDDE VTRNVFYRNF YDPYAWSWQH DNSRWDLLDV MRACYALRPE GINWPENDDG LPSFRLEHLT KANGIEHSNA HDAMADVYAT IAMAKLVKTR QPRLFDYLFT HRNKHKLMAL IDVPQMKPLV HVSGMFGAWR GNTSWVAPLA WHPENRNAVI MVDLAGDISP LLELDSDTLR ERLYTAKADL GDNAAVPVKL VHINKCPVLA QANTLRPEDA DRLGINRQHC LDNLKILREN PQVREKVVAI FAEAEPFTPS DNVDAQLYNG FFSDADRAAM KIVLETEPRN LPALDITFVD KRIEKLLFNY RARNFPGTLD YAEQQRWLEH RRQVFTPEFL QGYAEEIQML AQQYADDKEK VALLKALWQY AEEIV
|
| |