Gene SbBS512_E2192 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E2192 
Symbol 
ID6271489 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp1992573 
End bp1993961 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content54% 
IMG OID641726217 
ProductYjhS 
Protein accessionYP_001880705 
Protein GI187733233 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.00000609291 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTTTTA AACACTATGA TGTTGTCAGG GCGGCGTCGC CGTCAGATCT TGCGGAAAAG 
CTGACACATA AACTGAAAGA GGGCTGGCAG CCGTTTGGTA GTCCGGTGGC CATAACCCCT
TATACTCTGA TGCAGGCGAT TGCAGCAGAA GGTGCAGTAA TCAGCGCCAC CAGCGACCCG
GAGTATTACT TTGTTGTGGT TCTGGCAGGG CAGTCAAACG GCATGTCGTA TGGTGAAGGC
CTTCCGCTGC CGGGGACATA TGACCGTCCG GACCCGCGTA TTAAGCAACT GGCGCGTCGC
AGTACGGTGA CACCGGGCGG TGCAGCATGC AAATATAACG ACATCATTCC GGCGGACCAT
TGTCTGCATG ATGTGCAGGA CATGAGCCGT CTTAACCATC CGAAAGCGGA CCTGTCAAAG
GGGCAGTACG GAACCGTGGG GCAGGGGCTG CATATCGCCA AAAAACTGCT GCCGTTTATA
CCGGCGAATG CGGGCATTCT GCTGGTTCCA TGCTGTCGTG GTGGTTCAGC GTTCACCACC
GGAGCTGATG GCACATACAG TGACGCGAGT GGTGCCTCGG AGAATTCAAC CCGCTGGGGT
GTGGACAAGC CGCTGTATAA GGACCTTATC GGTCGAACAA AAGCGGCACT GGAGAAGAAC
CCGAAAAATG TGCTGTTTGC CGTGGTGTGG ATGCAGGGGG AATTTGATTT TGGCGGTACG
CCGGCAAATC ATGCCGCACA GTTTGGTGCG CTGGTTGATA AATTCCGTGC AGACCTGGCG
GATATGGCAG GCCAGTGCGT CGGTGGCTCT GCTGGCGGTG TTCCCTGGAT ATGCGGGGAC
ACGACGTATT TCTGGAAGCA GAAGAACGAA TCCACGTACC AGACGGTGTA CGGCAGCTAT
AAAAATAAAA CGGAAAAGAA TATCCATTTC GTACCGTTCA TGACCGATGA GAACGGGGTG
AATGTGCCGA CGAACAAACC GGAAGAAGAC CCGGACATTC CGGGTATCGG TTATTACGGT
TCGAAATGGC GTGACAGCTC AGCCACCTGG ACGTCACAGG ACAGGGCGAG CCATTTCAGT
TCATGGGCTC GCCGTGGGAT TATTTCCGAC CGTCTGGCAA CGGCGATTCT GAGCTGCGCG
GGTAAGTCTT CTGCGTTTGT TAATGGTACT GCCGGGGTGG TTGTTCCAGA CAGGCCGGTT
ACCACCTCAG AGTCTGTAAT TTTTTACGAT GCCAAAAAAG CTACAGACAA TCAGCTGAAA
CCTTATGGCT GGGACGGTAT GTATGGCAGA CGCACACTGG TTGATGACAG CGGCAATAAA
GCTCTGCGAA TTGAGAAAAA TAACAGCTCG AAATCCTGGT CAATGTACTG TGAGGTGTAC
TGGCAATAG
 
Protein sequence
MAFKHYDVVR AASPSDLAEK LTHKLKEGWQ PFGSPVAITP YTLMQAIAAE GAVISATSDP 
EYYFVVVLAG QSNGMSYGEG LPLPGTYDRP DPRIKQLARR STVTPGGAAC KYNDIIPADH
CLHDVQDMSR LNHPKADLSK GQYGTVGQGL HIAKKLLPFI PANAGILLVP CCRGGSAFTT
GADGTYSDAS GASENSTRWG VDKPLYKDLI GRTKAALEKN PKNVLFAVVW MQGEFDFGGT
PANHAAQFGA LVDKFRADLA DMAGQCVGGS AGGVPWICGD TTYFWKQKNE STYQTVYGSY
KNKTEKNIHF VPFMTDENGV NVPTNKPEED PDIPGIGYYG SKWRDSSATW TSQDRASHFS
SWARRGIISD RLATAILSCA GKSSAFVNGT AGVVVPDRPV TTSESVIFYD AKKATDNQLK
PYGWDGMYGR RTLVDDSGNK ALRIEKNNSS KSWSMYCEVY WQ