Gene SbBS512_E4048 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E4048 
SymbolrfaC 
ID6273300 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp3780766 
End bp3781746 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content51% 
IMG OID641727888 
ProductADP-heptose:LPS heptosyl transferase I 
Protein accessionYP_001882320 
Protein GI187733685 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0859] ADP-heptose:LPS heptosyltransferase 
TIGRFAM ID[TIGR02193] lipopolysaccharide heptosyltransferase I 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0000904915 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGGTTT TGATCGTTAA AACATCGTCG ATGGGCGATG TTCTCCATAC GTTGCCCGCA 
CTCACTGATG CCCAGCAGGC AATCCCAGGG ATTAAGTTTG ACTGGGTGGT GGAAGAAGGG
TTCGCACAGA TTCCTTCCTG GCACGCTGCC GTTGAGCGAG TTATTCCTGT GGCAATACGT
CGCTGGCGTA AAGCCTGGTT CTCGGCCCCC ATAAAAGCTG AACGCAAAGC GTTTCGTGAA
GCGCTACAAG CAGAGAACTA TGACGCAGTT ATCGACGCTC AGGGGCTGGT AAAAAGCGCG
GCACTGGTGA CACGTCTGGC GCATGGCGTA AAGCATGGAT TGGACTGGCA AACCGCTCGC
GAACCTTTAG CCAGCCTGTT TTACAATTGT AAGCATCATA TTGCAAAACA GCAGCACGCC
GTAGAACGCA CCCGCGAACT GTTTGCCAAA AGTTTGGGCT ATAGCAAACC GCAAACCCAG
GGCGATTATG CTATCGCACA GCATTTTCTG ACGAACCTGC CTACAGATGC TGGCGAATAT
GCCGTATTTC TTCATGCGAC GACCCGTGAT GATAAACACT GGCCGGAAGA ACACTGGCGA
GAATTGATTG GTTTACTGGC TGATTCAGGA ATACGGATTA AACTTCCGTG GGGCGCGCCG
CATGAGGAAG AACGGGCGAA ACGACTGGCG GAAGGATTTG ATTATGTTGA AGTATTGCCG
AAGATGAGTC TGGAAGGCGT TGCCCGCGTG CTGGCCGGGG CTAAATTTGT AGTGTCGGTG
GATACGGGGT TAAGCCATTT AACGGCGGCA CTGGATAGAC CCAATATCAC GGTTTATGGA
CCAACCGATC CGGGATTAAT TGGTGGGTAT GGGAAGAATC AGATGGTTTG TAGGGCTCCG
GGGAATGAGT TGTCTCAATT GACAGCAAAT GCTGTTAAGC GATTCATTGA AGAAAACGCT
GAAAACGCTG CTATGATTTA A
 
Protein sequence
MRVLIVKTSS MGDVLHTLPA LTDAQQAIPG IKFDWVVEEG FAQIPSWHAA VERVIPVAIR 
RWRKAWFSAP IKAERKAFRE ALQAENYDAV IDAQGLVKSA ALVTRLAHGV KHGLDWQTAR
EPLASLFYNC KHHIAKQQHA VERTRELFAK SLGYSKPQTQ GDYAIAQHFL TNLPTDAGEY
AVFLHATTRD DKHWPEEHWR ELIGLLADSG IRIKLPWGAP HEEERAKRLA EGFDYVEVLP
KMSLEGVARV LAGAKFVVSV DTGLSHLTAA LDRPNITVYG PTDPGLIGGY GKNQMVCRAP
GNELSQLTAN AVKRFIEENA ENAAMI