Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A1659 |
Symbol | |
ID | 4886894 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | + |
Start bp | 1601242 |
End bp | 1602759 |
Gene Length | 1518 bp |
Protein Length | 505 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640131598 |
Product | linear gramicidin synthetase subunit C |
Protein accession | YP_001062655 |
Protein GI | 126442769 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1020] Non-ribosomal peptide synthetase modules and related proteins |
TIGRFAM ID | [TIGR01733] amino acid adenylation domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.0625323 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACCTCAA CCAGCGTCAA TCGTCTGTTC ACTTCGCAAG CCCGCCTTGC GCCCGAAGCG CTCGCGCTCT CGAGCGGCGA CACGCGCCTC ACGTACGGCG AGCTCGAACG ATGCGCGAAC CACCTGGCCC GACGCCTCGT CGACAGCGGC GTGCGGCCGC GCGACCGGGT CCTGCTCTGC CTGCCGCGCT CGGTCGACGC GGTGATCGCG ATGCTCGCGA TCATGAAGAC CGGCGCGGCG TTCGTGCCGG TCGATCCCGC GTATTCCGAC GCGATCAAGC GCGGCTATGC GAGCGACAGC GGCGCGCGGC ACGCGCTCGC GCGCGCGGCC GACGCCGCGG CGTTTCGCGG CGGCGCGCTG GGCGTGATCG ACGCCGACGA TCTGTCGGCC GCACGCGATG ACGAGGGGCC CGAAGTCGAT GCGGGGCACG ACGGGGAAAC GCCGGTGTAC GTGATGTTCA CCTCCGGCAG CACCGGCCGG CCCAAGGGCG TGATCGTCGC GCACCGCGGC GTCGCGCGGC TCGTCAGGGA AACGAACTAT ATCCGGATCA CGCGCGAGGA CACGCTGCTG CTGCTCTCGC CGATCACGTT CGACGCATCG ACGTTCGAGA TCTGGGGGGC GCTGCTCAAC GGCGCGCGGC TTGCGATCTA CGAGGACGCC ACGTTCGATC CGAACGCCGT CAGCCGGCTC ATCGCGCGCG AGCAAGTAAG CGTGATGTGG CTCACCGCGG GGCTGTTCCA TCTGGTCGCG CGGCGCTTCA TCGGCATGCT GGCGGGGCTG CGCGTCGTGC TCGCGGGCGG CGACGTGCTG AGCGCCGCCG CGATCGGCGC GGTGTTCGAC GCGTTCCCGT CGATCACCGT CATCAACGGC TACGGCCCGA CCGAGAACAC GACGTTCACG TGCTGCCACG TGATGACGGC CGACCGGCGG CCGACCGGTA CGGTGCCGAT CGGCCGGCCG ATCGCGGGCA CCGACGTTCG CATTCTCGAC GCCGCGCTGC GCGAGGTGCC TGACGGCGAG GAAGGCGAGC TGTGCGCAAG CGGCCTCGGC GTCGCGCTCG GCTACCTGAA CGCGCCCGAC GCGACGCGCG CCGCGTTCGT CGACTGCCCG GCGACGGGCA GCCGGCTCTA TCGCACCGGC GACCGCGCAC GGCGCCGGGC GGACGGCGTG ATCGAGTTCC TCGGCCGCAG CGACCGGCTC GTGAAGATAC GCGGCTACCG CGTGTCGCTC GACGAGCTGC AATCCGTCCT CGCCGGCATT CCCGGCGTCG AGGAGGCGCT CGTCAAGGTA TCCGAAGAAG CGACCGGCGA GAAGCGCCTC AGCGCGATCG TCCAATCCGG CCGCGCCGAA CCGGACATGA AGGCCTACGT GCGCCGCGAA CTGGCCAAGC GCGTGCCGCC GTTCCAGATT CCCGACGACA TCCGGATTTT CCCGCACATC CCGCTCAACG CGAACGGCAA GCTCGACCGC CACCGGCTGC CGGCCAGCGA GACCTCGACC CTCGGAGAGA AGCCATGA
|
Protein sequence | MTSTSVNRLF TSQARLAPEA LALSSGDTRL TYGELERCAN HLARRLVDSG VRPRDRVLLC LPRSVDAVIA MLAIMKTGAA FVPVDPAYSD AIKRGYASDS GARHALARAA DAAAFRGGAL GVIDADDLSA ARDDEGPEVD AGHDGETPVY VMFTSGSTGR PKGVIVAHRG VARLVRETNY IRITREDTLL LLSPITFDAS TFEIWGALLN GARLAIYEDA TFDPNAVSRL IAREQVSVMW LTAGLFHLVA RRFIGMLAGL RVVLAGGDVL SAAAIGAVFD AFPSITVING YGPTENTTFT CCHVMTADRR PTGTVPIGRP IAGTDVRILD AALREVPDGE EGELCASGLG VALGYLNAPD ATRAAFVDCP ATGSRLYRTG DRARRRADGV IEFLGRSDRL VKIRGYRVSL DELQSVLAGI PGVEEALVKV SEEATGEKRL SAIVQSGRAE PDMKAYVRRE LAKRVPPFQI PDDIRIFPHI PLNANGKLDR HRLPASETST LGEKP
|
| |