Gene SbBS512_E1221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E1221 
SymbolsbcB 
ID6268606 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp1127192 
End bp1128619 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content50% 
IMG OID641725351 
Productexonuclease I 
Protein accessionYP_001879865 
Protein GI187733588 
COG category[L] Replication, recombination and repair 
COG ID[COG2925] Exonuclease I 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.171451 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGAATG ACGGTAAGCA ACAATCTACC TTTTTGTTTC ACGATTACGA AACCTTTGGC 
ACGCACCCCG CGTTAGATCG CCCTGCACAG TTCGCAGCCA TTCGCACCGA TAGCGAATTC
AATGTCATCG GCGAACCCGA AGTCTTTTAC TGCAAGCCCG CTGATGACTA TTTACCCCAG
CCAGGAGCCG TATTAATTAC CGGTATTACC CCGCAGGAAG CACGGGCGAA AGGAGAAAAC
GAAGCCGCGT TTGCCGCCCG TATTCACTCG CTTTTTACCG TACCGAAGAC CTGTATTCTG
GGCTACAACA ATGTGCGTTT CGACGACGAA GTCACACGCA ACATTTTTTA TCGTAATTTC
TACGATCCTT ACGCCTGGAG CTGGCAGCAT GATAACTCGC GCTGGGATTT ACTGGATGTT
ATGCGTGCCT GTTATGCCCT GCGCCCGGAA GGAATAAACT GGCCTGAAAA TGATGACGGT
CTACCGAGCT TTCGCCTTGA GCATTTAACC AAAGCGAATG GTATTGAACA TAGCAACGCC
CACGATGCGA TGGCTGATGT GTACGCCACT ATTGCGATGG CGAAACTGGT AAAAACGCGT
CAGCCACGTC TGTTTGATTA TCTCTTTACC CATCGTAATA AACACAAACT GATGGCGTTG
ATTGATGTTC CGCAGATGAA ACCCCTGGTT CACGTTTCCG GAATGTTTGG GGCATGGCGC
GGCAATACCA GCTGGGTGGC ACCGCTGGCG TGGCATCCAG AAAATCGCAA TGCCGTAATT
ATGGTGGATT TGGCAGGAGA CATTTCGCCA TTACTGGAAC TGGATAGCGA CACATTGCGC
GAGCGTTTAT ATACTGCAAA AGCCGATCTT GGCGATAACG CCGCCGTTCC GGTTAAGCTG
GTGCATATCA ATAAATGTCC GGTGCTGGCC CAGGCGAATA CGCTACGCCC GGAAGATGCC
GACCGACTGG GAATTAATCG TCAGCATTGC CTCGATAACC TGAAAATTCT GCGTGAAAAT
CCGCAAGTGC GCGAAAAAGT GGTGGCGATA TTCGCGGAAG CCGAACCGTT TACGCCTTCA
GATAACGTGG ATGCACAGCT TTATAACGGC TTTTTCAGTG ACGCAGATCG TGCAGCAATG
AAAATTGTGC TGGAAACCGA GCCGCGTAAT TTACCGGCAC TGGATATCAC TTTTGTTGAT
AAACGGATTG AAAAGCTGTT GTTCAATTAT CGGGCACGCA ACTTCCCGGG GACGCTGGAT
TATGCCGAGC AGCAACGCTG GCTGGAGCAC CGCCGCCAGG TATTCACGCC AGAGTTTTTG
CAGGGTTATG CTGATGAATT GCAGATGCTG GTACAACAAT ATGCCGATGA CAAAGAGAAA
GTGGCGCTGT TAAAAGCACT TTGGCAGTAC GCGGAAGAGA TCGTCTAA
 
Protein sequence
MMNDGKQQST FLFHDYETFG THPALDRPAQ FAAIRTDSEF NVIGEPEVFY CKPADDYLPQ 
PGAVLITGIT PQEARAKGEN EAAFAARIHS LFTVPKTCIL GYNNVRFDDE VTRNIFYRNF
YDPYAWSWQH DNSRWDLLDV MRACYALRPE GINWPENDDG LPSFRLEHLT KANGIEHSNA
HDAMADVYAT IAMAKLVKTR QPRLFDYLFT HRNKHKLMAL IDVPQMKPLV HVSGMFGAWR
GNTSWVAPLA WHPENRNAVI MVDLAGDISP LLELDSDTLR ERLYTAKADL GDNAAVPVKL
VHINKCPVLA QANTLRPEDA DRLGINRQHC LDNLKILREN PQVREKVVAI FAEAEPFTPS
DNVDAQLYNG FFSDADRAAM KIVLETEPRN LPALDITFVD KRIEKLLFNY RARNFPGTLD
YAEQQRWLEH RRQVFTPEFL QGYADELQML VQQYADDKEK VALLKALWQY AEEIV