Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A2342 |
Symbol | |
ID | 4886186 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | - |
Start bp | 2281264 |
End bp | 2282490 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640132279 |
Product | hypothetical protein |
Protein accession | YP_001063336 |
Protein GI | 126443127 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGGCACT ATTGCGAATG GCCTGATTTG CCCGCGCGCG CCTTCCGCAA GGCGCTCGGG AAGAACCGAC CGGAAACGCT CGAAGGCGGC GGCAAGGGCG GTGATGCGCC TGCAGCCCCG GATCCGTATG CCGTTGCGAA TGCAACTACC CAGACGAACA ATCAGACAGC GCAATTCAAC AAGGCGCTGA ACCTCAACAA CTACTCGAAC CCGTTCGGTT CGCAACAGTC GACGCAGATC GGCACTGATC CCGCGACGGG CGCGCCGATC TACAACACGA ACATCACCGC GAGCGGTCCG CTGCAGAGTC TCATCAATTC GACGATGGGC TCCGCGGGGA ACGCCAATTC GACGGTCAAC AATGCTCTGT TCGGGCTAGG CGGCCTGACG GCGCGCTACG ACGCGCTGAA TGGCAAGCTC GGCGCACTGG CGGGGCAGAT CGACCCGAAC GCCGCGCAGC TTGCCGGGCA GCGCGGCCAG AACGCTGCAT ACGCCGCGCA AACGCAGTAT CTCGATCCGC GCTTCTCGCA GGGGCAAACA AGCCTCGAGT CTCAGCTCGC GAATCAGGGC CTCACGCCAG GCTCGCAGGC ATACGATAAC GCGATGAAGA ACTTCAACCT GTCGAAGAAT CAGGCATATA GCGACGCGGC GAATCAATCG ATCCTTACCG GGCAGCAGAT CGGGACGCAG ATGTTGCAGA ACGAGCTCGC CGCGGTGGGG ACGCAAGCAG GACTTGTCGG GCAGCAGGGG CAAAACCTCG GGCAGCAAGG CGCGCTTTAC GGCCAGCAGG CATCGCTCGC GCAACTTCCG TTCTCTCAGC TCGCGACGCT CGCCAGCCTC GTGCCTGGCA ATACGGGCAC GGCGCAATCG GCCTCGTCGC CCGCGAACAT CGCTCAAGCA TTCCAGAACC AGTACGCAGG CCAGCTCAAT CAGTACAACA CGGGCGTGGC ATCCGCGAAT TCGACCATGG GCGGCCTGTT CGGGCTCGGT AGTGCCGGAT TGATGGGTTT CCTTCTTTCG GATCGGCGCT CAAAAACCGA CATTCATGCG ATCGGGCCGG CCGGTGACGG CGTCAATTTC TACCGTTTCC GCTATCGCTG GGAGGCGCCC GGCACCGTTC GTCATGGCCT GATGGCCGAC GAGGTGAAGC GCGTGCGGCC GGATGCCGTC GTGCGACACC CGAGCGGCTA TGACCTCGTG AATTACAACC GGGCGCTGGA GGCCTGA
|
Protein sequence | MRHYCEWPDL PARAFRKALG KNRPETLEGG GKGGDAPAAP DPYAVANATT QTNNQTAQFN KALNLNNYSN PFGSQQSTQI GTDPATGAPI YNTNITASGP LQSLINSTMG SAGNANSTVN NALFGLGGLT ARYDALNGKL GALAGQIDPN AAQLAGQRGQ NAAYAAQTQY LDPRFSQGQT SLESQLANQG LTPGSQAYDN AMKNFNLSKN QAYSDAANQS ILTGQQIGTQ MLQNELAAVG TQAGLVGQQG QNLGQQGALY GQQASLAQLP FSQLATLASL VPGNTGTAQS ASSPANIAQA FQNQYAGQLN QYNTGVASAN STMGGLFGLG SAGLMGFLLS DRRSKTDIHA IGPAGDGVNF YRFRYRWEAP GTVRHGLMAD EVKRVRPDAV VRHPSGYDLV NYNRALEA
|
| |