Gene SbBS512_E2083 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E2083 
SymbolmanX 
ID6271866 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp1894321 
End bp1895292 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content50% 
IMG OID641726120 
ProductPTS system, mannose-specific IIAB component 
Protein accessionYP_001880614 
Protein GI187731326 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2893] Phosphotransferase system, mannose/fructose-specific component IIA
[COG3444] Phosphotransferase system, mannose/fructose/N-acetylgalactosamine-specific component IIB 
TIGRFAM ID[TIGR00824] PTS system, mannose/fructose/sorbose family, IIA component
[TIGR00854] PTS system, mannose/fructose/sorbose family, IIB component 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000000444348 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACCATTG CTATTGTTAT AGGCACACAT GGTTGGGCTG CAGAGCAGTT GCTTAAAACG 
GCAGAAATGC TGTTAGGCGA GCAGGAAAAC GTCGGCTGGA TCGATTTCGT TCCAGGTGAA
AATGCCGAAA CGCTGATTGA AAAGTACAAC GCTCAGTTGG CAAAACTCGA CACCACTAAA
GGCGTGCTGT TTCTCGTTGA TACATGGGGA GGCAGCCCGT TCAATGCTGC CAGCCGCATT
GTCGTCGACA AAGAGCATTA TGAAGTCATT GCAGGCGTTA ACATTCCAAT GCTCGTGGAA
ACGTTAATGG CTCGTGATGA TGACCCAAGC TTTGATGAAC TGGTTGCGCT GGCAGTAGAA
ACAGGCCGTG AAGGCGTGAA AGCACTGAAA GCCAAACCGG TTGAAAAAGC CGCGCCAGCA
CCCGCTGCCG CAGCACCAAA AGCGGCTCCA ACTCCGGCAA AACCAATGGG GCCAAACGAC
TACATGGTTA TTGGCCTTGC GCGTATCGAC GACCGTCTGA TTCACGGTCA GGTCGCCACC
CGCTGGACCA AAGAAACCAA TGTCTCCCGT ATTATTGTTG TTAGTGATGA AGTGGCTGCG
GATACCGTTC GTAAGACACT GCTCACCCAG GTTGCACCTC CGGGCGTAAC AGCACACGTA
GTTGATGTTG CCAAAATGAT TCGCGTCTAC AACAACCCGA AATATGCTGG CGAACGCGTA
ATGCTGTTAT TTACCAACCC AACAGATGTA GAGCGTCTCG TTGAAGGCGG CGTGAAAATC
ACCTCTGTTA ACGTCGGTGG TATGGCATTC CGTCAAGGTA AAACCCAGGT GAATAACGCG
GTTTCGGTTG ATGAAAAAGA TATCGAGGCG TTCAAGAAAC TGAATGCGCG CGGTATTGAG
CTGGAAGTCC GTAAGGTTTC CACCGATCCG AAACTGAAAA TGATGGATCT GATCAGCAAA
ATCGATAAGT AA
 
Protein sequence
MTIAIVIGTH GWAAEQLLKT AEMLLGEQEN VGWIDFVPGE NAETLIEKYN AQLAKLDTTK 
GVLFLVDTWG GSPFNAASRI VVDKEHYEVI AGVNIPMLVE TLMARDDDPS FDELVALAVE
TGREGVKALK AKPVEKAAPA PAAAAPKAAP TPAKPMGPND YMVIGLARID DRLIHGQVAT
RWTKETNVSR IIVVSDEVAA DTVRKTLLTQ VAPPGVTAHV VDVAKMIRVY NNPKYAGERV
MLLFTNPTDV ERLVEGGVKI TSVNVGGMAF RQGKTQVNNA VSVDEKDIEA FKKLNARGIE
LEVRKVSTDP KLKMMDLISK IDK