Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1209 |
Symbol | |
ID | 7083869 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 1338856 |
End bp | 1339776 |
Gene Length | 921 bp |
Protein Length | 306 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643698225 |
Product | branched-chain amino acid aminotransferase |
Protein accession | YP_002354864 |
Protein GI | 217969630 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0115] Branched-chain amino acid aminotransferase/4-amino-4-deoxychorismate lyase |
TIGRFAM ID | [TIGR01122] branched-chain amino acid aminotransferase, group I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCATGG CTGACCGCGA CGGCTTCATC TGGCAGGACG GCAAGCTCGT TCCGTGGCGC GAGGCGACCA CCCACGTCCT CACCCACTCG CTCCACTACG GCCTCGCCGT GTTCGAGGGC GTCCGTGCCT ACAACACCGT CAGCGGCACC GCGATCTTCC GCCTCGCCGA GCACACCCAG CGCCTGATCA ATTCCGGCAA GATCTACATG ATGGACATCC CCTATTCCAA GGATGAACTC ATGGAGGCGC AGAAGGAGGT CGTGCGCGCG AACAAGCTCG AGTCCTGCTA CCTGCGCCCG ATCGCCTTCT ACGGCTCGGA AAAAATGGGC ATCTCCACGC TCGGCGCGCG CGTCCATGTC GCGATCGCGG CCTGGCCCTG GGGCGCCTAC CTCGGCGAGG AAGGCCTGCA GAAGGGCATC CGCGTGAAGA CCTCGTCCTA CACCCGCCAC CACGTCAACT CGACGATGCC GCGCGCCAAG CTGTCGGCGA CCTACCCGAA CTCGATCCTC GCCAACCTCG AGGTCACCCG CATGGGCTAC GACGAGGCCC TGCTGCTCGA CAACCAGGGC TTCGTCGCCG AGGGTGCAGG CGAGAACCTC TTCATCGTCA AGGACGGCCG CATCTACGAG CCCGAGATCG CCTCCGCGCT CACCGGCATC ACCCGCGACT CCATCCACGT GATCGCGCGC GAGCTCGGCT ACGAGGTCGG CACCAAGCGC CTGACCCGCG ACGACATCTA CCTCGCCGAC GAGGCCTTCT TCACCGGCAC CGCCGCCGAG GTCACCCCGA TCCGCGAGCT CGACGACCGC CAGATCGGCG AAGGCAGGCG CGGCCCGGTC ACCGAGAAGA TCCAGACCCG CTTCTTCGAC GTCGTCAACG GCCGTGCGCC CGAATACGCG CACTGGCTCG CCCACGTCTG A
|
Protein sequence | MSMADRDGFI WQDGKLVPWR EATTHVLTHS LHYGLAVFEG VRAYNTVSGT AIFRLAEHTQ RLINSGKIYM MDIPYSKDEL MEAQKEVVRA NKLESCYLRP IAFYGSEKMG ISTLGARVHV AIAAWPWGAY LGEEGLQKGI RVKTSSYTRH HVNSTMPRAK LSATYPNSIL ANLEVTRMGY DEALLLDNQG FVAEGAGENL FIVKDGRIYE PEIASALTGI TRDSIHVIAR ELGYEVGTKR LTRDDIYLAD EAFFTGTAAE VTPIRELDDR QIGEGRRGPV TEKIQTRFFD VVNGRAPEYA HWLAHV
|
| |