Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_2080 |
Symbol | |
ID | 4882697 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | - |
Start bp | 2067908 |
End bp | 2069008 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640128008 |
Product | saccharopine dehydrogenase |
Protein accession | YP_001059115 |
Protein GI | 126439892 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1748] Saccharopine dehydrogenase and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.599315 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAATCG CCATCGTCGG CGCGGGTCTC ATCGGCCATA CGATCGCCCA TCTGTTGCGC GAAACCGGCG ACTACGAAGT AGTCGCATTC GATCGCGATG CGGACGCGCT CGCGAAGCTG GCAAACGAGG GCATCGCGAC GCAGCGCGTC GATTCCGCGG ACGCGGCCGC GATCCGCGAA GCGGTGAAGG GCTTCGATGC GCTCGTCAAT GCGCTGCCGT ATTACCTGGC CGTCAACGTC GCCGCCGCCG CGAAGGCCGC GGGCGTGCAT TACTTCGATC TGACCGAGGA CGTGCGCGCG ACGAGCGCGA TCCGCGAGCT CGCCGAAGGC TCGAATCGCG CGTTCATGCC GCAGTGCGGC CTCGCGCCGG GCTTCATCGG CATCGCCGCG CACGAGCTCG TGAACGGCTT CACCGAAGTG CGCGACGTGA AGATGCGCGT CGGCGCGCTA CCCGAGTATC CGACCAACGC GCTGAAGTAC AACCTGACGT GGAGCGTCGA CGGTCTCATC AACGAATACT GCCAGCCGTG CGAAGCGGTG CGCGACGGCC GCCGCCAATG GGTGCAGCCG CTCGAAGGGC TCGAGCACTT CTCGCTCGAC GGCATCGAAT ACGAGGCGTT CAACACGTCG GGCGGCCTCG GCACGCTGTG CGAGACGCTC GAGGGCAAGG TCGAGACGCT CGATTACAAG TCGGTCCGCT ACCCGGGCCA CCGCGAGCTG ATCCAGTTCC TGCTCGAGGA TCTGCGCCTC GCGACCGATC GCGATACGCT GAAGTCGATC ATGCGCCGCG CGGTGCCGTC GACGAAGCAG GACGTCGTGC TCGTGTTCGT CACGGTGACG GGCGTGAAGC ACGGCCAGCT CGTGCAGGAC GTGTTCACGC GTAAGATCTT CGCGAAGGAG ATCTGCGGGA TGCCGATGAG CGCGATCCAG ATCACGACGG CGGGCGCGAT GTGCGCGGTG CTCGATCTGT TCCGCGAAAA GAAGTTGCCG CAAAGCGGCT TCGTGCGCCA GGAGCAGGTG CCGCTGCATG CGTTCCTCGC GAACCGCTTC GGCAAGCTGT ACGAGGGCGG CACGCTCGAG CGGATGCACG CGCTCGCATG A
|
Protein sequence | MKIAIVGAGL IGHTIAHLLR ETGDYEVVAF DRDADALAKL ANEGIATQRV DSADAAAIRE AVKGFDALVN ALPYYLAVNV AAAAKAAGVH YFDLTEDVRA TSAIRELAEG SNRAFMPQCG LAPGFIGIAA HELVNGFTEV RDVKMRVGAL PEYPTNALKY NLTWSVDGLI NEYCQPCEAV RDGRRQWVQP LEGLEHFSLD GIEYEAFNTS GGLGTLCETL EGKVETLDYK SVRYPGHREL IQFLLEDLRL ATDRDTLKSI MRRAVPSTKQ DVVLVFVTVT GVKHGQLVQD VFTRKIFAKE ICGMPMSAIQ ITTAGAMCAV LDLFREKKLP QSGFVRQEQV PLHAFLANRF GKLYEGGTLE RMHALA
|
| |