Gene SbBS512_E4057 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E4057 
SymbolrfaQ 
ID6269922 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp3789891 
End bp3790949 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content47% 
IMG OID641727897 
Productlipopolysaccharide core biosynthesis protein 
Protein accessionYP_001882329 
Protein GI187733694 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0859] ADP-heptose:LPS heptosyltransferase 
TIGRFAM ID[TIGR02201] lipopolysaccharide heptosyltransferase III, putative 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.245519 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGATAAGC TATTTCGAAG AATTTTGCTC ATTAAGATGC GTTTTCATGG GGATATGTTA 
TTAACTACTC CCGTCATTAG TTCGCTGAAA AAAAATTACC CTGACGCAAA AATCGATGTG
CTGCTTTATC AGGACACCAT CCCGATCCTG TCTGAAAATC CAGAGATTAA CGCGCTCTAC
GGCATAAAAA ATAAAAAAGC AAAAGCCTCA GAAAAAATTG CCAACTTTTT TCATCTCATC
AAGGTATTAC GTGCCAATAA GTATGACCTT ATCGTCAATC TCACCGATCA ATGGATGGTT
GCTATACTGG TTCGCTTATT AAATGCCCGT GTGAAAATTT CCCAGGATTA TCATCATCGG
CAGTCTGCTT TTTGGCGTAA AAGTTTCACC CATTTGGTGC CGTTGCAGGG TGGAAATGTG
GTGGAAAGTA ACTTATCCGT GCTGACCCCA TTGGGAGTTG ATTCGTTGGT GAAGCAGACA
ACCATGAGTT ACCCGCCTGC AAGCTGGAAA CGTATGCGTC GCGAACTTGA TCACGCTGGT
GTTGGACAAA ATTATGTGGT TATCCAACCT ACGGCGCGGC AAATCTTCAA ATGCTGGGAC
AACGCCAAGT TTTCCGCTGT GATTGATGCC TTACATGTTC GTGGTTATGA AGTTGTTCTG
ACGTCCGGCC CAGATAAAGA CGATCTGGCC TGCGTCAATG AAATTGCGCA GGGATGCCAG
ACGCCACCAG TAACGGCGCT GGCTGGAAAG GTGACCTTCC CGGAACTTGG TGCGTTAATC
GATCATGCGC AGCTGTTTAT TGGCGTTGAT TCCGCACCGG CGCATATTGC CGCTGCAGTT
AATACGCCGC TGATATCGCT GTTTGGTGCG ACAGACCATA TTTTCTGGCG TCCCTGGTCA
AATAACATGA TTCAATTCTG GGCGGGAGAT TACCGGGAAA TGCCAACGCG CGATCAGCGT
GACCGAAATG AGATGTATCT TTCGGTTATT CCGGCGGCAG ATGTCATTGC TGCTGTCGAT
AAATTACTGC CCTCCTCCAC GACAGGTACG TCGTTATGA
 
Protein sequence
MDKLFRRILL IKMRFHGDML LTTPVISSLK KNYPDAKIDV LLYQDTIPIL SENPEINALY 
GIKNKKAKAS EKIANFFHLI KVLRANKYDL IVNLTDQWMV AILVRLLNAR VKISQDYHHR
QSAFWRKSFT HLVPLQGGNV VESNLSVLTP LGVDSLVKQT TMSYPPASWK RMRRELDHAG
VGQNYVVIQP TARQIFKCWD NAKFSAVIDA LHVRGYEVVL TSGPDKDDLA CVNEIAQGCQ
TPPVTALAGK VTFPELGALI DHAQLFIGVD SAPAHIAAAV NTPLISLFGA TDHIFWRPWS
NNMIQFWAGD YREMPTRDQR DRNEMYLSVI PAADVIAAVD KLLPSSTTGT SL