Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_2238 |
Symbol | arnC |
ID | 4884073 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | + |
Start bp | 2228296 |
End bp | 2229303 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640128166 |
Product | undecaprenyl-phosphate 4-deoxy-4-formamido-L-arabinose transferase |
Protein accession | YP_001059273 |
Protein GI | 126438968 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0463] Glycosyltransferases involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTCACC CTGAAACACG CGCGACGCAT CCTGAAGTTT CGATCGTCAT CCCCGTGTAC AACGAGGAAG CGGGGCTCGC CGCGCTCTTC GCGCGGCTCT ACCCGGCGCT CGACGCGCTC GGCACGCCGT ACGAGGTGAT CCTCGTCAAC GACGGCAGCC GCGACCGCTC GGCCGCCCTC CTCGCCGATC AGTTCCGCGT GCGTCCGGAC ACGACGCGCG TCGTGCTGCT GAACGGCAAC TACGGCCAGC ACATGGCGAT CCTCGCGGGC TTCGAGCAGT CGCGCGGCGA GATCGTCATC ACGCTCGACG CCGATCTGCA GAACCCGCCG GAGGAAATCG GCAAGCTGAT CGCGAAGATG CGCGAAGGCT ACGACTACGT CGGCTCGATC CGGCTGCAGC GCCAGGACAG CCTGTTCCGC CGCAAGGCGT CGGCCGCGAT GAACCGGCTG CGCGAGCGCA TCACGCGCAT CAAGATGACC GACCAGGGCT GCATGCTGCG CGCGTACAGC CGCCACATCA TCGACACGAT CAACCGCTGC GGCGAGGTGA ACACGTTCAT CCCCGCGCTC GCGTACACGT TCGCGCAAAA CCCGACCGAA ATCGAGGTCG CGCACGAAGA GCGCTTCGCG GGCGAATCGA AATACTCGCT GTACAGCCTG ATCCGCCTGA ACTTCGATCT CGTCACGGGC TTCTCGGTCG TGCCGCTGCA ATGGCTGTCG TTCATCGGCG TGATCCTCTC GCTCGGCTCG GCCGCGCTCT TCGTGCTGCT CGTCGTGCGC CGCTTCATCG TCGGCGCGGA AGTGCAGGGC GTGTTCACGC TGTTCGCGAT CACGTTCTTC CTGCTCGGCG TGATCATCTT CGCGCTCGGC CTGCTTGGCG AATACATCGG ACGAATCTAC CAGCAGGTCC GCGCGCGGCC GCGCTATCTG ATCCACACCG TGCTCGAGGC GCGCGACGGC AAGCCCGGCG TCACGCTCAC CGCCGAGCGC CGCGAGGCCG CGCGATGA
|
Protein sequence | MTHPETRATH PEVSIVIPVY NEEAGLAALF ARLYPALDAL GTPYEVILVN DGSRDRSAAL LADQFRVRPD TTRVVLLNGN YGQHMAILAG FEQSRGEIVI TLDADLQNPP EEIGKLIAKM REGYDYVGSI RLQRQDSLFR RKASAAMNRL RERITRIKMT DQGCMLRAYS RHIIDTINRC GEVNTFIPAL AYTFAQNPTE IEVAHEERFA GESKYSLYSL IRLNFDLVTG FSVVPLQWLS FIGVILSLGS AALFVLLVVR RFIVGAEVQG VFTLFAITFF LLGVIIFALG LLGEYIGRIY QQVRARPRYL IHTVLEARDG KPGVTLTAER REAAR
|
| |