Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_4069 |
Symbol | |
ID | 4901849 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009076 |
Strand | - |
Start bp | 3973857 |
End bp | 3975542 |
Gene Length | 1686 bp |
Protein Length | 561 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640137295 |
Product | GMC family oxidoreductase |
Protein accession | YP_001068288 |
Protein GI | 126454463 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2303] Choline dehydrogenase and related flavoproteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACTACCG AACGTACGCT CGAAGGCGAA TTCGATTATG TGATCGTCGG CGCCGGCACG GCGGGCTGCG TGCTCGCGAA CCGGCTCACC GAGGATCCGG ACGTGACCGT GCTGCTGCTC GAAGCCGGCG GCAGGGACGA CTATCACTGG ATCCACATCC CGGTCGGCTA TCTGTATTGC ATCGGCAATC CGCGCACCGA CTGGCTCTAC AAGACCGAGC CCGAAGCGGG CCTGAACGGC CGCGCGCTGT CGTATCCGCG CGGGCGCGTG CTGGGCGGCT CGTCGTCGAT CAACGGAATG ATCTACATGC GCGGCCAGCG CGGCGATTAC GACGACTGGG CGCGCGCCAC GGGCGACGCG GGCTGGTCGT GGGACAGTGT GCTGCCCGTC TTCAGGCGCA GCGAGGATCA TCATGCGGGC GCGACCGACA TGCACGGCGC GGGCGGCATG TGGCGCGTCG AGAAGCAGCG GCTGCGCTGG GAGATTCTCG AGGCATTCTC GCAGGCCGCG CAGCAGACGG GCATTCCGGC CACCGACGAT TTCAACCGCG GCGACAACAC GGGCGTCGGC TATTTCGAAG TCAATCAGAA GCGCGGCATT CGCTGGAACG CGTCGAAGGC GTTCTTGCGC CCTGCGCTCG CGCGGCCGAA CCTCACGGTG ATCACCGGCG CGCAGGCCGA GCGGCTCGTG TTCGACGGCA AGCGCTGCGC GGGGGTCGAA TATCGCGGCG GCGGCGCGCC GTTCGTCGCG TGCGCGCGCG TCGAGGTGCT CGTTGCGTCG GGCGCGGTGA ATTCGCCGCA GTTGCTCGAA TTGTCGGGCA TCGGCGACGG CAGCCGGCTG CAGGCGCTCG GCATCGGCGT CATCGCGGAT CTGCGCGGCG TCGGCGAAAA TCTTCAGGAT CACTTGCAGT TGCGCATGGC GTTTCGCGTG CGCGGCGTGC GCACGCTGAA CACGCTGTCC GCGCACTGGT GGGGCAAGCT GTGGATCGGC GCGCAATACG CGCTGATGCA GCGCGGGCCG ATGTCGATGG CGCCGTCGCA ATTGGGCGCG TTCGCGAAAT CGGACCCGAA CGATCCGGCA CTCGCGCGGC CCGATCTCGA ATATCACGTG CAGCCGCTGT CGCTCGAGCG CTTCGGCGAG CCGCTGCATC GCTTCAACGC GTTCACCGCG TCGGTCTGCC ATCTGCGGCC GACGTCGCGC GGCAGCGTCC ATGCGGCGTC GCCGGATCCG GCGCGCGCGC CGTCGATTGC GCCGAACTAT CTGTCGACCG ATTACGATCG CCATGTCGCG GCGAACGCGC TGCGCCTGAC GCGCCGGATC GCCTCGGCGC CCGCGCTCGC GCGCTATGCA CCCGAAGAAA TCCTGCCGGG CGCCCGGTAT GTGAGCGAAG CGGAGCTGAT CGCCGCGGCG GGCGCCGTGG GCACGACGAT TTTCCATCCG GTCGGCACGT GCCGGATGGG GCGCGCCGAC GATCCGGACG CCGTCGTCGA TAGCCGCCTG CGCGTGCGCG GCGTGACGGG GCTGCGGGTC GTCGACGCGT CGGTGATGCC GACGATCACG TCGGGCAACA CGAACTCCCC GACGCTGATG ATCGCCGAGC GCGCGAGCGA CATGATTCGC GCGGATCGCC GCGGCGCATC GGAGCGCGGC GCGAGCGCGC GCGCCGAAGC CGTGCTGCCG ACGTAA
|
Protein sequence | MTTERTLEGE FDYVIVGAGT AGCVLANRLT EDPDVTVLLL EAGGRDDYHW IHIPVGYLYC IGNPRTDWLY KTEPEAGLNG RALSYPRGRV LGGSSSINGM IYMRGQRGDY DDWARATGDA GWSWDSVLPV FRRSEDHHAG ATDMHGAGGM WRVEKQRLRW EILEAFSQAA QQTGIPATDD FNRGDNTGVG YFEVNQKRGI RWNASKAFLR PALARPNLTV ITGAQAERLV FDGKRCAGVE YRGGGAPFVA CARVEVLVAS GAVNSPQLLE LSGIGDGSRL QALGIGVIAD LRGVGENLQD HLQLRMAFRV RGVRTLNTLS AHWWGKLWIG AQYALMQRGP MSMAPSQLGA FAKSDPNDPA LARPDLEYHV QPLSLERFGE PLHRFNAFTA SVCHLRPTSR GSVHAASPDP ARAPSIAPNY LSTDYDRHVA ANALRLTRRI ASAPALARYA PEEILPGARY VSEAELIAAA GAVGTTIFHP VGTCRMGRAD DPDAVVDSRL RVRGVTGLRV VDASVMPTIT SGNTNSPTLM IAERASDMIR ADRRGASERG ASARAEAVLP T
|
| |