Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_4000 |
Symbol | gcvP |
ID | 4900284 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009076 |
Strand | + |
Start bp | 3900507 |
End bp | 3903419 |
Gene Length | 2913 bp |
Protein Length | 970 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640137225 |
Product | glycine dehydrogenase |
Protein accession | YP_001068218 |
Protein GI | 126451827 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0403] Glycine cleavage system protein P (pyridoxal-binding), N-terminal domain [COG1003] Glycine cleavage system protein P (pyridoxal-binding), C-terminal domain |
TIGRFAM ID | [TIGR00461] glycine dehydrogenase (decarboxylating) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCTCG AACACCCGGA CCGCCTGATG AACCGCACGC CCCTCTCGCT CGCCGCGCTC GAAACGCACG ACGCGTTCGC CGAACGCCAT ATCGGCCCCG ACGCCGCCAG CCAGCAGGCC ATGCTCGACA CGCTCGGCTT CGCGACGCGC GCCGCACTGA TCGACGCCGT GATCCCCGCG TCGATCCGCC GCGCCGAAAC GCTGCCGCTC GGCCCGTTCG CGCAGCCGAA GAGCGAAGCC GAAGCGCTCG CCGCGCTGCG CGCGCTCGCG GACAAGAACC AGGTGTTCCG CTCGTACATC GGCCAAGGCT ACTACGACAC CCACACCCCG GCGGTGATCC TGCGCAACGT GCTCGAAAAC CCGGCGTGGT ACACCGCGTA CACGCCGTAC CAACCCGAAA TCTCGCAGGG CCGCCTCGAG GCGCTGCTGA ACTTCCAGCA GATGGTCGCC GACCTGACGG GCCTCGAGAT CTCCAACGCG TCGCTGCTCG ACGAAGCCAC GGCCGCAGCC GAAGCGATGA CGCTGCTGCA ACGCGTCGGC AAGCCGCAGT CGAACGTGTT CTACGTCGCC GACGACGTGC TGCCGCAAAC GCTCGAAGTA ATCAAGACGC GCGCGAAGCC GATCGGCATC GAAGTGAAGT CGGGCCCGGC CGCCGACGCC GCCGCCGCGA ACGCGTTCGG CGTGCTGCTG CAATATCCGG GCGCGAACGG CGACGTGCGC GACTACCGCG CGCTCGCCGA CGCGATCCAC GCCGCGGGCG GCCACGTCGT CGTCGCGGCC GACATCCTCG CGCTCACCGT GCTCATGCCG CCCGGCGAAT GGGGCGCGGA CGTCGCCGTC GGCAACACGC AGCGCTTCGG CGTGCCGATG GGCTTCGGCG GCCCGCACGC CGCATACATG GCGGTGCGCG ACGAATTCAA GCGGCAGATG CCGGGCCGCC TCGTCGGCGT GACCGTCGAC GCGCAGGGCA AGCCCGCGCT GCGCCTCGCG CTGCAAACGC GCGAGCAACA CATCCGCCGC GAGAAGGCAA CGTCGAACGT CTGCACCGCG CAGGCGCTGC TCGCGATCAT GGCGAGCATG TACGCGGTCT ACCACGGCCC GCGCGGCCTG AAGACGATCG CGCTGCGCGT GAACCGCATC GCGGCGCTCC TCGCCGCGGG CATCAGGCAT CTCGGCTACG CAACCGTCAA CGACACGTTC TTCGACACGC TGACGATCGA CACCGGCGCG CGCACCGCGC AACTCCATGC GTTCGCGCAA GCGAAGCGCA TCAACCTGCG CCGCGCGGGC GACACGCGAG TCGGCGTGTC GGTCGACGAA ACGACGACGC GCGCCGATCT CGCCGATCTG CTCACGATCT TCGCGCAGGC CGCGGGCGCG ACGGCGCCCG ACATCGACGC GCTCGACGCC GGGCTCCTCC CCGCGCCCGC GCTGCCGCCG AGCCTCGAGC GCACGAGCGC GTACCTGACG CACCACGTGT TCAACCGCCA CCATTCGGAA ACGGAAATGC TGCGCTACCT GCGCAGCCTG TCGGACAAGG ATCTCGCGCT CGACCGCTCG ATGATCCCGC TCGGCTCGTG CACGATGAAG CTGAACGCGA CCTCCGAAAT GCTGCCCGTC ACGTGGCCCG AATTCGGCCG GATCCACCCG TTCGCGCCCG CCGAGCAGAC CGTCGGCTAT CGCGAGATGA TCGACCAGCT CGAGCAGATG CTCGTCGCGG CAACGGGCTA CGCGGCCGTG TCGCTGCAGC CGAACGCCGG CTCGCAGGGC GAGTACGCGG GCCTGCTCAT CATCCATGCG TATCACGAAT CGCGCGGCGA AAGCCACCGC GATGTCTGCC TGATTCCGGC GTCCGCGCAC GGCACGAACC CGGCGTCGGC GCACATGGCC GGCATGAAGG TCGTGGTGGT CGCGTGCGAC GCGCAAGGCA ACGTCGACAT CGCCGACCTG AAGGCGAAGG CCGACGCGCA TTCGCACGAC CTCGCGGCGA TCATGATCAC GTATCCGTCG ACGCACGGCG TGTTCGAGCA GAACGTGCGC GAGATCTGCG AGATCGTCCA CGCGCACGGC GGCCAGGTGT ACGTCGACGG CGCGAACATG AACGCGATGG TCGGCCTCAC CGCGCCCGGC CAGTTCGGCG GCGACGTGTC GCACTTGAAC CTGCACAAGA CCTTCTGCAT CCCGCACGGC GGCGGCGGCC CGGGCGTCGG CCCGCACCTC GCGAAATTCC TGCCGAACCA GCGCTCGACG GGCTACGCGC GCGGCGAAGA CGGCATCGGC GCGGTGTCGG CGGCGCCTTA CGGCTCGGCG TCGATCCTGC CGATCTCGTG GATGTACATC GCGATGATGG GCGCGAAGAA TTTGACCGCG GCGACCGAAA CCGCGATCCT CAACGCGAAC TACATCGCGA AGCGCCTCGC GCCGCACTAT CCGGTGCTGT ATTCGGGCCC GGGCGGGCTC GTCGCGCACG AATGCATTCT CGATCTGCGC CCGATCAAGG ATTCGAGCGG CATCACCGTC GACGACGTCG CCAAGCGCCT GATGGACTAC GGCTTTCACG CACCGACGAT GAGCTTCCCG GTGCCGGGCA CGCTGATGGT CGAGCCGACC GAATCGGAAT CGCAGGAGGA ACTGGACCGC TTCATCGCGG CGATGATCGC GATCCGCGAC GAAATCCGCG CAGTCGAGGA AGGCCGCGCC GACCGCGAGG ACAACCCGCT GCGTCACGCG CCGCACACGG CAGCCGTCGT CACCGCGAAC GAATGGCCGC ACGCGTACTC GCGCGAACAG GCCGCGTTCC CGGTCGCGTC GCTCGTCGCG AACAAGTACT GGCCGCCCGT CGGCCGCGCG GACAACGCAT ATGGCGACCG CAATCTGTTC TGCTCCTGCG TGCCGGTATC GGATTACGCC TGA
|
Protein sequence | MKLEHPDRLM NRTPLSLAAL ETHDAFAERH IGPDAASQQA MLDTLGFATR AALIDAVIPA SIRRAETLPL GPFAQPKSEA EALAALRALA DKNQVFRSYI GQGYYDTHTP AVILRNVLEN PAWYTAYTPY QPEISQGRLE ALLNFQQMVA DLTGLEISNA SLLDEATAAA EAMTLLQRVG KPQSNVFYVA DDVLPQTLEV IKTRAKPIGI EVKSGPAADA AAANAFGVLL QYPGANGDVR DYRALADAIH AAGGHVVVAA DILALTVLMP PGEWGADVAV GNTQRFGVPM GFGGPHAAYM AVRDEFKRQM PGRLVGVTVD AQGKPALRLA LQTREQHIRR EKATSNVCTA QALLAIMASM YAVYHGPRGL KTIALRVNRI AALLAAGIRH LGYATVNDTF FDTLTIDTGA RTAQLHAFAQ AKRINLRRAG DTRVGVSVDE TTTRADLADL LTIFAQAAGA TAPDIDALDA GLLPAPALPP SLERTSAYLT HHVFNRHHSE TEMLRYLRSL SDKDLALDRS MIPLGSCTMK LNATSEMLPV TWPEFGRIHP FAPAEQTVGY REMIDQLEQM LVAATGYAAV SLQPNAGSQG EYAGLLIIHA YHESRGESHR DVCLIPASAH GTNPASAHMA GMKVVVVACD AQGNVDIADL KAKADAHSHD LAAIMITYPS THGVFEQNVR EICEIVHAHG GQVYVDGANM NAMVGLTAPG QFGGDVSHLN LHKTFCIPHG GGGPGVGPHL AKFLPNQRST GYARGEDGIG AVSAAPYGSA SILPISWMYI AMMGAKNLTA ATETAILNAN YIAKRLAPHY PVLYSGPGGL VAHECILDLR PIKDSSGITV DDVAKRLMDY GFHAPTMSFP VPGTLMVEPT ESESQEELDR FIAAMIAIRD EIRAVEEGRA DREDNPLRHA PHTAAVVTAN EWPHAYSREQ AAFPVASLVA NKYWPPVGRA DNAYGDRNLF CSCVPVSDYA
|
| |