Gene BURPS668_2238 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_2238 
SymbolarnC 
ID4884073 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp2228296 
End bp2229303 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content66% 
IMG OID640128166 
Productundecaprenyl-phosphate 4-deoxy-4-formamido-L-arabinose transferase 
Protein accessionYP_001059273 
Protein GI126438968 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTCACC CTGAAACACG CGCGACGCAT CCTGAAGTTT CGATCGTCAT CCCCGTGTAC 
AACGAGGAAG CGGGGCTCGC CGCGCTCTTC GCGCGGCTCT ACCCGGCGCT CGACGCGCTC
GGCACGCCGT ACGAGGTGAT CCTCGTCAAC GACGGCAGCC GCGACCGCTC GGCCGCCCTC
CTCGCCGATC AGTTCCGCGT GCGTCCGGAC ACGACGCGCG TCGTGCTGCT GAACGGCAAC
TACGGCCAGC ACATGGCGAT CCTCGCGGGC TTCGAGCAGT CGCGCGGCGA GATCGTCATC
ACGCTCGACG CCGATCTGCA GAACCCGCCG GAGGAAATCG GCAAGCTGAT CGCGAAGATG
CGCGAAGGCT ACGACTACGT CGGCTCGATC CGGCTGCAGC GCCAGGACAG CCTGTTCCGC
CGCAAGGCGT CGGCCGCGAT GAACCGGCTG CGCGAGCGCA TCACGCGCAT CAAGATGACC
GACCAGGGCT GCATGCTGCG CGCGTACAGC CGCCACATCA TCGACACGAT CAACCGCTGC
GGCGAGGTGA ACACGTTCAT CCCCGCGCTC GCGTACACGT TCGCGCAAAA CCCGACCGAA
ATCGAGGTCG CGCACGAAGA GCGCTTCGCG GGCGAATCGA AATACTCGCT GTACAGCCTG
ATCCGCCTGA ACTTCGATCT CGTCACGGGC TTCTCGGTCG TGCCGCTGCA ATGGCTGTCG
TTCATCGGCG TGATCCTCTC GCTCGGCTCG GCCGCGCTCT TCGTGCTGCT CGTCGTGCGC
CGCTTCATCG TCGGCGCGGA AGTGCAGGGC GTGTTCACGC TGTTCGCGAT CACGTTCTTC
CTGCTCGGCG TGATCATCTT CGCGCTCGGC CTGCTTGGCG AATACATCGG ACGAATCTAC
CAGCAGGTCC GCGCGCGGCC GCGCTATCTG ATCCACACCG TGCTCGAGGC GCGCGACGGC
AAGCCCGGCG TCACGCTCAC CGCCGAGCGC CGCGAGGCCG CGCGATGA
 
Protein sequence
MTHPETRATH PEVSIVIPVY NEEAGLAALF ARLYPALDAL GTPYEVILVN DGSRDRSAAL 
LADQFRVRPD TTRVVLLNGN YGQHMAILAG FEQSRGEIVI TLDADLQNPP EEIGKLIAKM
REGYDYVGSI RLQRQDSLFR RKASAAMNRL RERITRIKMT DQGCMLRAYS RHIIDTINRC
GEVNTFIPAL AYTFAQNPTE IEVAHEERFA GESKYSLYSL IRLNFDLVTG FSVVPLQWLS
FIGVILSLGS AALFVLLVVR RFIVGAEVQG VFTLFAITFF LLGVIIFALG LLGEYIGRIY
QQVRARPRYL IHTVLEARDG KPGVTLTAER REAAR