Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_3416 |
Symbol | |
ID | 4899709 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009076 |
Strand | - |
Start bp | 3332353 |
End bp | 3333468 |
Gene Length | 1116 bp |
Protein Length | 371 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 640136642 |
Product | hypothetical protein |
Protein accession | YP_001067653 |
Protein GI | 126454421 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.630747 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGCTC GGAGTGCCGG GAAGGTTCAA GAAGCTCAAG AAGCTCAAGA AGCGCGGGAC GCGACGCGCC GCTGTGCGCA GCGGCCGACA GGCGCGCATG ACGCGACCGA TCACCGGCGC GCGGACCCGG CGACAACCGG TTTTGCCGAC CGGGCCGACG GATCGTGCGG CGCGGGCGAT CCGCGCGGCG CTTGCGGGCC GGCGCACCCC GAGCACGCGC GGCGCTCGGC CGCCGGCGGC GGCCCTGACG CACCCCGCGC GTTTGCTGTT TCGATGCGCC GACGCGGATC GTCCTGCGCC GATGGCGGAG CCTGTTTCGC ACGAATCGAT TGGTCCGCGC CATGGCTCGC CCCGCTCGCC GATCGCGGCG AACGATGGAC GCACGCGGCG CAACGGGGCG AGGCGGCATG GCTGCGCATG CTGAACGACG AGGCCCGAGC CGAGCGGCTC GCGACGGGGC GCGGCTTGCC GCTTCGCTTC ATCGCGCAGG CGGCGTTGCC GGCGGGCATC GCGTACGAGA CCCACATCGC CGAAACGGGC GCCGTGCCGA CCCGCCACAA TCTGCATGAT TTCTTCAATG CGCTCGTCTG GTTCGCGTAT CCGCGCATCA AGGCGGCGCT CAACGCGCGC CAGGCCGCGG CGATCGACGC GGCGGGCGTC GGCGCCGTGC GCGGCGGCGT TCGCGACGCC CTCACGCTGC TCGACGAGAA CGGCGCGCTC TTCGCGACGT CGGATCCGGC GCTGGCCGCC GCACTGCGCG GCTTCGACTG GCCGACGCTG ATGCGCGCGT CGCGCGATGC ATGGGGCGCG CGTTGCGATG CGCGGATCGT CGGCCATGCG CTCTGCGAAA AGCTCGTCGA TCCGTACAAG GGCTGCACCG CGCACGCATG GATCGTCGAG GTGCCGGCCG CGTATTTCGA CTGGCCCGAC GCGCGGCGCC GCGCCTGGCT TGACGAGCGC GTGGCCGCGG CACTCGCCGC GACCGATCCG GCAAGCCGCG GCTTCGCGCC GCTGCCGGTG CTGGGCGTGC CCGGCTGGTG GCCGGCGAAC GCGTCGCCGG CCTTCTACGA CGATCCGCAG GTGTTTCGCC GCGGCCGCCG CGCGCGAGCG GAGTGA
|
Protein sequence | MSARSAGKVQ EAQEAQEARD ATRRCAQRPT GAHDATDHRR ADPATTGFAD RADGSCGAGD PRGACGPAHP EHARRSAAGG GPDAPRAFAV SMRRRGSSCA DGGACFARID WSAPWLAPLA DRGERWTHAA QRGEAAWLRM LNDEARAERL ATGRGLPLRF IAQAALPAGI AYETHIAETG AVPTRHNLHD FFNALVWFAY PRIKAALNAR QAAAIDAAGV GAVRGGVRDA LTLLDENGAL FATSDPALAA ALRGFDWPTL MRASRDAWGA RCDARIVGHA LCEKLVDPYK GCTAHAWIVE VPAAYFDWPD ARRRAWLDER VAAALAATDP ASRGFAPLPV LGVPGWWPAN ASPAFYDDPQ VFRRGRRARA E
|
| |