Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_0585 |
Symbol | |
ID | 8135900 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 719736 |
End bp | 720806 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644868202 |
Product | branched-chain amino acid aminotransferase |
Protein accession | YP_003020417 |
Protein GI | 253699228 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0115] Branched-chain amino acid aminotransferase/4-amino-4-deoxychorismate lyase |
TIGRFAM ID | [TIGR01123] branched-chain amino acid aminotransferase, group II |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 55 |
Fosmid unclonability p-value | 0.00279686 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGAGATCA GGATTGTGCC CCTCAGCGAG GGAGAGAAGA AGCCGAAATT CACCGACGAA TCACAGCTTG GCTTTGGCAA GATTTTCACC GACCGCATGC TTTTGGTCGA GTGGAAAGTC GGCCAGGGGT GGGTAGATGC CAGGATCAAG AAGTACGAAC CGTTCCTGCT CGATCCGGCG GCGCTCGTGC TGCATTATGC CCAGGAAATC TTCGAAGGGC TCAAGGCCTA CAAGTGGAAG GACGGCAGCA TCGCGCTGTT CCGCCCCGAG ATGAACGCCC GCCGCTTCAA CCACTCCGCT GACCGTCTCT GCATGCCCGA GATCCCGGAG GAGCTCTTTG TGAGCGGCAT CGAGCAGTTG GTCTCCGCCG AGCGCGACTG GGTTCCCGGC GCCGAAGGGA CTTCGCTCTA CATCCGCCCC ACCATGATCG CGGTGGAGCC GCTGGTCGGC ATCAAGCCTT CGGACCACTA CTACTTCTAC GTGATACTCT CCCCGGTCGG CGCCTATTAC GCCAACGGCT TCAATCCGGT GAAGATCATG GTCGAGGACC ACTACGTCAG GGCCACCCCC GGCGGCACCG GCGAGGCCAA GACCGGCGGC AACTACGCCA GCTCCCTCAA GGCGGGGCTC GAGGCCAAGA AGAAAGGTTT CGACCAGGTG CTCTGGCTGG ACGGCGTGCA CAAGCGCTAC ATCGAGGAAG TAGGCTCGAT GAACATGTTC TTCGCCTACG GCGACACCAT CGTCACCGCG CCGCTTGAAG GAAGCATCCT GAACGGCATC ACCCGCGACT CCGTGCTGAC CCTGGCCAAG TCGTTGAACC TGAAGGTCGA GGAGCGGCGC ATCGACGTGA AGGACCTGAT GGCGGATCTC AAAAGCGGGA AGATCACCGA GGCGTTCGGC AGCGGCACCG CCGCGGTCGT CACCCCTGTG GGTACGCTGA GCTATCTCGG CGAATCGGTA CAGGTAGGAA CCGGCGGCGT CGGCAAATAC ACCCAGGTGC TCTACGACAC GCTGACCGGA ATCCAGACCG GCAAGATCGA AGACAAATTC GGCTGGATCA GGAAGATCTA G
|
Protein sequence | MEIRIVPLSE GEKKPKFTDE SQLGFGKIFT DRMLLVEWKV GQGWVDARIK KYEPFLLDPA ALVLHYAQEI FEGLKAYKWK DGSIALFRPE MNARRFNHSA DRLCMPEIPE ELFVSGIEQL VSAERDWVPG AEGTSLYIRP TMIAVEPLVG IKPSDHYYFY VILSPVGAYY ANGFNPVKIM VEDHYVRATP GGTGEAKTGG NYASSLKAGL EAKKKGFDQV LWLDGVHKRY IEEVGSMNMF FAYGDTIVTA PLEGSILNGI TRDSVLTLAK SLNLKVEERR IDVKDLMADL KSGKITEAFG SGTAAVVTPV GTLSYLGESV QVGTGGVGKY TQVLYDTLTG IQTGKIEDKF GWIRKI
|
| |