Gene SbBS512_A0261 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_A0261 
Symbol 
ID6273458 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010660 
Strand
Start bp175200 
End bp176288 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content43% 
IMG OID641728882 
Productputative glycosyl transferase, group 1 family protein 
Protein accessionYP_001883273 
Protein GI187734295 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones144 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATATAC TATTTACGGA ATCATCACCG AATATAGGTG GTCAGGAATT ACAGGCTGTT 
GCTCAAATGA AGGCCCTGAA GAAAATGGGG CATTCAGTTC TGCTTGTCTG CAGGGAAAAC
AGCAAAATTG CTTTTGAAGC CAGTAAATTG GGAATTGATA TCACATTCGC GTTATTTCGA
AACAGTCTTC ACATCCCTAC TGCATGGAGA TTACTCGGAA TAGTTCATGG TTTTCAGCCC
AATGCAATCG TTTGTCACAG TGGGCATGAT AGCAATATTG TTGGTTTAGT ACGGTTATTC
ACTTGGAAAC ATCCATTCAG AATTATCAGG CAAAAGACAT ATTTGACACG AAAAACAAAA
GTTTTTTCAA TAAATCATTT TTGCGATGAG GTGATTGTTC CCGGAACAAG TATGAAGACA
CATCTGGAGC AGGAAGGATG TCGAACCCGG GTTACTGTTG TGCCTCCAGG CTTTGATTTC
CAGAAATTAT ACGTTGATTC CCGAAACAGT TTGCCTCCAA ATGTTCTTTC TTGGCTGGCG
TCCCGAAGGG GATGCCCCGT TATTGCTCAG GTAGGAATGT TGCGCCCGGA AAAAGGGCAC
GAATTTATGT TGAATTTACT GTTCCATTTA AAAATGAATG GACGACAGTT CTGTTGGTTG
ATTGTGGGGT CTGGTTCGCC TGAACTGCGG GAGCATTTAC AGTATCAGAT TGACAGTATG
GGCATGCATG ATGATGTTTT TATTGCTGAC AATGTTTTTC CTGCCGCCCC CGTATATCGG
GTTGCCAGTC TGGTGGTTCT GCCTTCAGAA AACGAATCGT TTGGTATGGT ACTGGCAGAA
GCATCGGCAT TTTCTGTGCC TGTACTGGCC AGTCAGATTG GTGGAATCCC TGATGTTATT
CAGAACAACC AGACCGGGAC ATTGTTACCA GCAGGTAATA AGCACGCATG GATGTGCGCC
CTGAATGATT TTTTTAATGA CCCTGGGCGT TTTTATCAGA TGGCTCGCCA GGCAAAACAG
GATATAGAAG AGCGGTTTGA TATTAATAAA ACTGCGTTAA AAATACTCAC ATTAGCGAAG
CACAAGTAA
 
Protein sequence
MNILFTESSP NIGGQELQAV AQMKALKKMG HSVLLVCREN SKIAFEASKL GIDITFALFR 
NSLHIPTAWR LLGIVHGFQP NAIVCHSGHD SNIVGLVRLF TWKHPFRIIR QKTYLTRKTK
VFSINHFCDE VIVPGTSMKT HLEQEGCRTR VTVVPPGFDF QKLYVDSRNS LPPNVLSWLA
SRRGCPVIAQ VGMLRPEKGH EFMLNLLFHL KMNGRQFCWL IVGSGSPELR EHLQYQIDSM
GMHDDVFIAD NVFPAAPVYR VASLVVLPSE NESFGMVLAE ASAFSVPVLA SQIGGIPDVI
QNNQTGTLLP AGNKHAWMCA LNDFFNDPGR FYQMARQAKQ DIEERFDINK TALKILTLAK
HK