Gene BURPS1106A_2276 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_2276 
SymbolarnC 
ID4900959 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp2261819 
End bp2262826 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content66% 
IMG OID640135505 
Productundecaprenyl-phosphate 4-deoxy-4-formamido-L-arabinose transferase 
Protein accessionYP_001066540 
Protein GI126454149 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTCACC CTGAAACACG CGCGACGCAT CCTGAAGTTT CGATCGTCAT CCCCGTGTAC 
AACGAGGAAG CGGGGCTCGC CGCGCTCTTC GCGCGGCTCT ACCCGGCGCT CGACGCGCTC
GGCACGCCGT ACGAGGTGAT CCTCGTCAAC GACGGCAGCC GCGACCGCTC GGCCGCCCTC
CTCGCCGATC AGTTCCGCGT GCGTCCGGAC ACGACGCGCG TCGTGCTGCT GAACGGCAAC
TACGGCCAGC ACATGGCGAT CCTCGCGGGC TTCGAGCAGT CGCGCGGCGA GATCGTCATC
ACGCTCGACG CCGATCTGCA GAACCCGCCG GAGGAAATCG GCAAGCTGAT CGCGAAGATG
CGCGAAGGCT ACGACTACGT CGGCTCGATC CGGCTGCAGC GCCAGGACAG CCTGTTCCGC
CGCAAGGCGT CGGCCGCGAT GAACCGGCTG CGCGAGCGCA TCACGCGCAT CAAGATGACC
GACCAGGGCT GCATGCTGCG CGCGTACAGC CGCCACATCA TCGACACGAT CAACCGCTGC
GGCGAGGTGA ACACGTTCAT CCCCGCGCTC GCGTACACGT TCGCGCAAAA CCCGACCGAA
ATCGAGGTCG CGCACGAAGA GCGCTTCGCG GGCGAATCGA AATACTCGCT GTACAGCCTG
ATCCGCCTGA ACTTCGATCT CGTCACGGGC TTCTCGGTCG TGCCGCTGCA ATGGCTGTCG
TTCATCGGCG TGATCCTCTC GCTCGGCTCG GCCGCGCTCT TCGTGCTGCT CGTCGTGCGC
CGCTTCATCG TCGGCGCGGA AGTGCAGGGC GTGTTCACGC TGTTCGCGAT CACGTTCTTC
CTGCTCGGCG TGATCATCTT CGCGCTCGGC CTGCTCGGCG AATACATCGG ACGAATCTAC
CAGCAGGTCC GCGCGCGGCC GCGCTACCTG ATCCACACCG TGCTCGAGGC GCGCGACGGC
AAGCCCGGCG TCACGCTCGC CGCCGAGCGC CGCGAGGCCG CGCGATGA
 
Protein sequence
MTHPETRATH PEVSIVIPVY NEEAGLAALF ARLYPALDAL GTPYEVILVN DGSRDRSAAL 
LADQFRVRPD TTRVVLLNGN YGQHMAILAG FEQSRGEIVI TLDADLQNPP EEIGKLIAKM
REGYDYVGSI RLQRQDSLFR RKASAAMNRL RERITRIKMT DQGCMLRAYS RHIIDTINRC
GEVNTFIPAL AYTFAQNPTE IEVAHEERFA GESKYSLYSL IRLNFDLVTG FSVVPLQWLS
FIGVILSLGS AALFVLLVVR RFIVGAEVQG VFTLFAITFF LLGVIIFALG LLGEYIGRIY
QQVRARPRYL IHTVLEARDG KPGVTLAAER REAAR