Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2782 |
Symbol | gabT2 |
ID | 6143779 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2866351 |
End bp | 2867631 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 641617651 |
Product | 4-aminobutyrate aminotransferase |
Protein accession | YP_001744811 |
Protein GI | 170683877 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0160] 4-aminobutyrate aminotransferase and related aminotransferases |
TIGRFAM ID | [TIGR00700] 4-aminobutyrate aminotransferase, prokaryotic type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 0.480055 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAGCA ATAAAGAGTT AATGCAGCGC CGCAGTCAGG CGATTCCCCG TGGCGTTGGG CAAATTCACC CGATTTTCGC TGACCGCGCG GAAAACTGCC GGGTGTGGGA CGTTGAAGGC CGTGAGTATC TTGATTTCGC GGGCGGGATT GCGGTGCTCA ATACCGGGCA CCTGCATCCG AAGGTGGTGG CCGCGGTGGA AGCGCAGTTG AAAAAACTGT CGCACACCTG CTTCCAGGTG CTGGCTTACG AGCCGTATCT GGAGCTGTGC GAGATTATGA ATCAGAAGGT GCCGGGCGAT TTCGCCAAGA AAACGCTGCT GGTTACGACC GGTTCCGAAG CGGTGGAAAA CGCGGTGAAA ATCGCCCGCG CCGCCACCAA ACGTAGCGGC ACCATCGCTT TTAGCGGCGC GTATCACGGG CGCACGCATT ACACGCTGGC GCTGACCGGC AAGGTGAATC CGTACTCTGC GGGCATGGGG CTGATGCCGG GTCATGTTTA TCGCGCGCTT TATCCTTGCC CGCTGCACGG CATAAGCGAG GATGACGCTA TCGCCAGCAT CCACCGGATC TTCAAAAATG ATGCCGCGCC GGAAGATATC GCCGCCATCG TGATTGAGCC CGTTCAGGGC GAAGGCGGTT TCTACGCCGC GACGCCAGCC TTTATGCAGC GTTTACGCGC GCTGTGTGAC GAGCACGGGA TCATGCTGAT TGCCGATGAA GTGCAGAGCG GCGCAGGGCG TACCGGCACG CTGTTTGCGA TGGAGCAAAT GGGCGTGGCA CCAGATCTCA CCACCTTTGC GAAATCGATC GCAGGCGGTT TCCCGCTGGC GGGCGTCACC GGGCGCGCGG AAGTGATGGA CGCCGTCGCA CCGGGCGGAC TGGGTGGCAC CTATGCCGGT AACCCGATTG CCTGCGTAGC GGCGCTGGAA GTGTTGAAGG TGTTCGAGCA GGAAAATCTG CTGCAAAAAG CCAACGTTCT GGGGCAGAAG CTGAAAGACG GATTGCTGGC GATCGCCGAA AAACACCCGG AGATCGGCGA CGTACGCGGG CTGGGGGCGA TGATCGCCAT CGAGCTGTTT GAAGATGGCG ATCCCAGCAA ACCGGACGCA AAACTCACCG CCGAGATCGT GGCACGCGCC CGCGATAAGG GTCTGATTCT TCTCTCCTGC GGCCCGTATT ACAACGTGCT GCGCATCCTT GTACCGCTCA CCATTGAAGA CGCTCAGATC CGTCAGGGTC TGGAGATCAT CAGCCAGTGT TTTGCTGAGG CGAAGCTGTA G
|
Protein sequence | MSSNKELMQR RSQAIPRGVG QIHPIFADRA ENCRVWDVEG REYLDFAGGI AVLNTGHLHP KVVAAVEAQL KKLSHTCFQV LAYEPYLELC EIMNQKVPGD FAKKTLLVTT GSEAVENAVK IARAATKRSG TIAFSGAYHG RTHYTLALTG KVNPYSAGMG LMPGHVYRAL YPCPLHGISE DDAIASIHRI FKNDAAPEDI AAIVIEPVQG EGGFYAATPA FMQRLRALCD EHGIMLIADE VQSGAGRTGT LFAMEQMGVA PDLTTFAKSI AGGFPLAGVT GRAEVMDAVA PGGLGGTYAG NPIACVAALE VLKVFEQENL LQKANVLGQK LKDGLLAIAE KHPEIGDVRG LGAMIAIELF EDGDPSKPDA KLTAEIVARA RDKGLILLSC GPYYNVLRIL VPLTIEDAQI RQGLEIISQC FAEAKL
|
| |