Gene SbBS512_E3765 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E3765 
SymboldamX 
ID6270480 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp3490279 
End bp3491565 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content54% 
IMG OID641727628 
Producthypothetical protein 
Protein accessionYP_001882063 
Protein GI187731317 
COG category[S] Function unknown 
COG ID[COG3266] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.00105538 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATGAAT TCAAACCAGA AGACGAGCTG AAACCCGATC CCAGCGATCG TCGTACTGGT 
CGTTCTCGTC AATCTTCTGA ACGTTCTGAG CGTACTGAAC GTGGCGAACC GCAGATCAAT
TTTGATGATA TTGAACTTGA TGACACTGAC GATCGCCGTC CGACTCGTGC GCAAAAAGAG
CGCAATGAGG AACCGGAAAT CGAAGAAGAA ATTGACGAAT CCGAAGATGA AACCGTGGAT
GAAGAGCGCG TAGAGCGTCG TCCGCGTAAG TGCAAAAAAG CAGCCAGTAA ACCCGCTTCT
CGTCAGTATA TGATGATGGG CGTCGGCATT CTGGTTCTAC TGCTGTTGAT CATCGGTATC
GGTTCTGCGC TAAAAGCCCC CTCGACCTCT TCCAGCGATC AAACCGCGTC TGGCGAGAAG
AGTATTGATC TTGCAGGCAA TGCGACCGAT CAGGCGAATG GGGTGCAGCC AGCGCCGGGA
ACCACGTCTG CGGAAAATAC TCAGCAGGAT GTTTCTCTGC CACCGATCTC TTCTACGCCG
ACTCAAGGGC AAACCCCGGC GGCAACGGAT GGTCAACAAC GTGTTGAAGT GCAGGGTGAC
CTGAACAATG CGCTGACTCA GCCACAAAAT CAGCAACAGT TGAACAATGT GGCGGTCAAT
TCCACATTGC CGACCGAACC AGCGACTGTC GCGCCTGTTC GCAATGGCAA TGCATCGCGT
GACACGGCGA AAACGCAAAC CGCTGAACGT CCGTCCACTA CGCGCCTAGC TCGTCAGCAG
GCGGTGATTG AACCGAAAAA ACCGCAAGCA ACCGTGAAAA CGGAGCCGAA GCCGGTAGCA
CAGACGCCGA AGCGTACTGA ACCAGCTGCC CCTGTGGCGA GCACGAAGGC ACCGGCTGCG
ACTTCTACGC CAGCACCAAA AGAGACGGCG ACTACGGCTC CAGTACAGAC GGCATCCCCG
GCGCAAACCA CGGCAACACC AGCCGCTGGA GGGAAGACCG CAGGTAATGT TGGTTCGTTG
AAATCGGCAC CGTCCAGCCA TTACACTCTG CAGCTGAGCA GTTCCTCTAA CTACGACAAC
CTGAACGGTT GGGCGAAGAA AGAGAATCTG AAAAACTACG TTGTCTATGA AACGACGCGT
AATGGTCAGC CGTGGTATGT CCTGGTTTCT GGCGTGTACG CTTCGAAAGA AGAGGCGAAA
AAAGCGGTAT CTACATTGCC AGCAGATGTT CAGGCCAAAA ACCCGTGGGC GAAACCGCTG
CGTCAGGTAC AGGCCGATCT GAAGTAA
 
Protein sequence
MDEFKPEDEL KPDPSDRRTG RSRQSSERSE RTERGEPQIN FDDIELDDTD DRRPTRAQKE 
RNEEPEIEEE IDESEDETVD EERVERRPRK CKKAASKPAS RQYMMMGVGI LVLLLLIIGI
GSALKAPSTS SSDQTASGEK SIDLAGNATD QANGVQPAPG TTSAENTQQD VSLPPISSTP
TQGQTPAATD GQQRVEVQGD LNNALTQPQN QQQLNNVAVN STLPTEPATV APVRNGNASR
DTAKTQTAER PSTTRLARQQ AVIEPKKPQA TVKTEPKPVA QTPKRTEPAA PVASTKAPAA
TSTPAPKETA TTAPVQTASP AQTTATPAAG GKTAGNVGSL KSAPSSHYTL QLSSSSNYDN
LNGWAKKENL KNYVVYETTR NGQPWYVLVS GVYASKEEAK KAVSTLPADV QAKNPWAKPL
RQVQADLK