Gene SbBS512_E2284 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E2284 
SymbolmdoC 
ID6268900 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp2076933 
End bp2078090 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content44% 
IMG OID641726298 
Productglucans biosynthesis protein 
Protein accessionYP_001880782 
Protein GI187733850 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value0.624965 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCCAG TACCCGCGCA ACGTGAATAT TTCCTCGACT CCATCCGCGC CTGGCTGATG 
TTGTTAGGGA TCCCTTTTCA TATTTCTTTA ATCTATTCGA GCCATACATG GCATGTGAAT
AGCGCCGAAC CGTCATTGTG GCTGACCCTT TTTAATGACT TCATCCACTC GTTCCGCATG
CTGGTATTTT TCGTTATATC CGGCTACTTT TCCTACATGC TTTTTTTACG CTATCCCTTG
AAAAAATGGT GGAAAGTACG TGTCGAACGT GTAGGTATCC CGATGTTAAC AGCCATCCCC
TTACTGACAT TGCCGCAATT TATTATGCTG CAATACGTCA AAGGGAAAGC GGAAAGTTGG
CCTGGGCTGT CATTGTATGA CAAATATAAT ACGTTGGCCT GGGAATTAAT ATCACACCTG
TGGTTTTTAC TGGTGTTAGT AGTCATGACG ACGCTGTGCG TATGGATATT TAAACGCATC
AGAAATAATT TAGAAAATTC TGATAAAACG AATAAAAAAT TCTCGATGGT AAAACTATCG
GTGATTTTTT TATGCCTCGG CATCGGCTAT GCGGTAATAA GAAGAACGAT TTTTATTGTG
TATCCGCCCA TTCTGAGTAA TGGCATGTTC AATTTTATTG TCATGCAAAC GCTATTTTAT
TTGCCGTTCT TTATCCTCGG TGCACTGGCT TTCATTTTCC CTCATCTTAA AGCCTTGTTT
ACCACGCCGT CTCGTGGCTG TACCCTTGCA GCAGCATTGG CATTTGTCGC TTATTTACTC
AACCAACGCT ATGGCAGTGG CGATGCCTGG ATGTACGAAA CCGAGTCGGT GATCACCATG
GTCCTCGGTC TGTGGATGGT GAATGTGGTC TTCTCCTTTG GCCACCGTTT GCTTAACTTC
CAGTCAGCGC GGGTGACTTA CTTTGTTAAC GCATCGCTAT TTATCTATCT GGTTCACCAC
CCGTTAACGC TGTTTTTCGG CGCATACATT ACACCGCACA TCACCTCCAA CTGGCTTGGT
TTTCTCTGTG GCCTGATATT CGTAGTAGGG ATTGCGATAA TTCTGTATGA AATTCATTTA
CGCATCCCGT TACTGAAGTT TTTGTTTTCT GGTAAACCGG TTGTTAAGCG TGAGAACGAT
AAAGCACCAG CCCGTTAA
 
Protein sequence
MNPVPAQREY FLDSIRAWLM LLGIPFHISL IYSSHTWHVN SAEPSLWLTL FNDFIHSFRM 
LVFFVISGYF SYMLFLRYPL KKWWKVRVER VGIPMLTAIP LLTLPQFIML QYVKGKAESW
PGLSLYDKYN TLAWELISHL WFLLVLVVMT TLCVWIFKRI RNNLENSDKT NKKFSMVKLS
VIFLCLGIGY AVIRRTIFIV YPPILSNGMF NFIVMQTLFY LPFFILGALA FIFPHLKALF
TTPSRGCTLA AALAFVAYLL NQRYGSGDAW MYETESVITM VLGLWMVNVV FSFGHRLLNF
QSARVTYFVN ASLFIYLVHH PLTLFFGAYI TPHITSNWLG FLCGLIFVVG IAIILYEIHL
RIPLLKFLFS GKPVVKREND KAPAR