Gene BURPS1106A_3118 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_3118 
SymbolrfaC 
ID4902997 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp3037968 
End bp3039035 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content71% 
IMG OID640136344 
Productlipopolysaccharide heptosyltransferase I 
Protein accessionYP_001067356 
Protein GI126452253 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0859] ADP-heptose:LPS heptosyltransferase 
TIGRFAM ID[TIGR02193] lipopolysaccharide heptosyltransferase I 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.145361 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCGGCCG CCCGGCGCGA TAAAATCCGT CCTTTCGGTC TGTGCCGGCC GCCGCCGGCC 
CTTTTTTTCA GCGTGCAAAA AATTCTGATC GTGCGCGTGT CGTCGCTCGG CGATGTCGTG
CATAACATGC CGGTGATCGC CGATATCCGC CGGCGTCACC CCGATGCGCA GATCGACTGG
CTCGTCGAGG AAGGCTTCGC CGATCTCGTG CGGCTCGTCG ACGGTGTGCG CGACGTGCTG
CCGTTCTCGC TGCGGCGCTG GCGCAAGCGC TTGAGCGCAT CGCAAACGTG GCGCGAGATC
CGCGCGTTCC GGCGGCGCCT CGCCGAGGAG CGCTACGACC TCGTGATCGA CTGCCAGGGG
CTCATCAAGA CCGCGTGGGT CGCGAGCTGG GCGCGCGGGC CGCTTGTCGG CCTCGGCAAC
CGCACCGACG GCGCCGGCTA CGAGTGGCCG GTGCGCTTCT TCTACGACAG GCGGGTGCCG
ATCGCGCCGC GCACGCACGT CGTCGAGCGC TCGCGGCAGC TCGTCGCGGC GGCGCTGGGA
GACCCCGCGC CGGCGCCCGG CGAGCCGATC GATTTCGGCC TCGACACGCA TGGCGCGGCG
CGCGCGCTCG CGGCGCTCGA TTTGAATCTG CCGGTGCCCT ACGTGGTATT CGTGCACGCG
ACCTCGCGCG CCGACAAGCA GTGGCCCGAC GAAGCGTGGA CCGGCCTCGG CGAGGCGCTC
GTGCGGCGCG GCGCGTCGCT CGTGCTGCCG TGGGGCAGCG ACGCCGAGCG CGCGACGAGC
GAGCGCCTCG CGAAGGCGTT CGGCGCGGCG GCGATCGTGC CGCCGAAGCT GTCGCTGCCC
GCGGTCGTCG GCCTCGTCGA CGGCGCGGCG GCGACGGTCG GCGTCGATAC CGGCCTCGTC
CACATCGCGG CGGCGCTCAA GCGTCCGACC GTCGAACTGT ACAATTTCGC GACAGCCTGG
CGCACGGGCG GCTACTGGTC GCCCAACGTC GTCAATCTCG GCACCGCCGG CGCGCCGCCG
TCCCTTTCGC AGGCGAAGGA CGCACTCGCG TCGTTCGGCC TCTTGTAA
 
Protein sequence
MSAARRDKIR PFGLCRPPPA LFFSVQKILI VRVSSLGDVV HNMPVIADIR RRHPDAQIDW 
LVEEGFADLV RLVDGVRDVL PFSLRRWRKR LSASQTWREI RAFRRRLAEE RYDLVIDCQG
LIKTAWVASW ARGPLVGLGN RTDGAGYEWP VRFFYDRRVP IAPRTHVVER SRQLVAAALG
DPAPAPGEPI DFGLDTHGAA RALAALDLNL PVPYVVFVHA TSRADKQWPD EAWTGLGEAL
VRRGASLVLP WGSDAERATS ERLAKAFGAA AIVPPKLSLP AVVGLVDGAA ATVGVDTGLV
HIAAALKRPT VELYNFATAW RTGGYWSPNV VNLGTAGAPP SLSQAKDALA SFGLL