Gene ECH74115_3904 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3904 
SymbolgabT2 
ID6970839 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3619158 
End bp3620438 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content58% 
IMG OID643387678 
Product4-aminobutyrate aminotransferase 
Protein accessionYP_002272126 
Protein GI209398707 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0160] 4-aminobutyrate aminotransferase and related aminotransferases 
TIGRFAM ID[TIGR00700] 4-aminobutyrate aminotransferase, prokaryotic type 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value0.888156 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAGCA ATAAAGAGTT AATGCAGCGC CGCAGTCAGG CAATTCCTCG TGGCGTTGGG 
CAAATTCACC CCATTTTCGC TGATCGCGCG GAAAACTGCC GGGTGTGGGA CGTTGAAGGC
CGTGAGTATC TTGATTTCGC GGGCGGCATT GCGGTGCTCA ATACCGGGCA CCTGCATCCG
AAAGTGGTTG CCGCGGTGGA AGCGCAGTTG AAAAAACTGT CGCACACCTG CTTCCAGGTG
CTGGCTTACG AGCCGTATCT GGAGCTGTGC GAGATTATGA ATCAGAAGGT GCCGGGCGAT
TTTGCCAAGA AAACGCTGCT GGTTACGACC GGTTCCGAAG CGGTGGAAAA CGCGGTGAAA
ATCGCCCGCG CTGCCACCAA ACGTAGCGGC ACCATCGCTT TTAGCGGCGC GTATCACGGG
CGCACGCATT ACACGCTGGC GCTGACCGGC AAGGTGAATC CGTACTCTGC GGGCATGGGC
CTGATGCCAG GGCACGTTTA TCGCGCGCTT TATCCTTGCC CACTGCACGG CATCAGTGAA
GATGATGCTA TCGCCAGTAT CCACCGAATT TTTAAAAATG ATGCTGCGCC GGAAGATATC
GCCGCCATCG TGATTGAGCC GGTTCAGGGC GAAGGCGGTT TCTACGCCGC GACGCCTGCG
TTTATGCAGC GTTTACGCGC GCTGTGTGAC GAGCACGGGA TCATGCTGAT TGCCGATGAA
GTGCAGAGCG GCGCGGGGCG TACCGGCACG CTGTTTGCGA TGGAGCAAAT GGGCGTGGCA
CCAGATCTCA CCACCTTTGC GAAATCGATC GCAGGCGGCT TCCCACTGGC GGGCGTCACC
GGGCGCGCCG AAGTGATGGA TGCCGTCGCT CCAGGCGGGC TGGGTGGCAC CTATGCCGGT
AATCCGATTG CCTGCGTGGC GGCGCTGGAA GTGTTGAAGG TGTTCGAGCA GGAAAATCTG
CTGCAGAAAG CCAACGATCT GGGGCAGAAG TTGAAAGATG GATTGTTGGC GATCGCCGAA
AAACACCCTG AGATCGGCGA CGTACGCGGG CTGGGGGCGA TGATCGCCAT CGAGCTGTTT
GAAGACGGCG ATCACAACAA GCCGGACGCC AAACTCACCG CCGAGATCGT GGCTCGCGCC
CGCGATAAAG GCCTGATTCT TCTCTCCTGC GGCCCGTATT ACAACGTGCT GCGCATCCTT
GTACCGCTCA CCATTGAAGA CGCTCAGATC CGTCAGGGTC TGGAGATCAT CAGCCAGTGT
TTTGCTGAGG CAAAGCAGTA G
 
Protein sequence
MSSNKELMQR RSQAIPRGVG QIHPIFADRA ENCRVWDVEG REYLDFAGGI AVLNTGHLHP 
KVVAAVEAQL KKLSHTCFQV LAYEPYLELC EIMNQKVPGD FAKKTLLVTT GSEAVENAVK
IARAATKRSG TIAFSGAYHG RTHYTLALTG KVNPYSAGMG LMPGHVYRAL YPCPLHGISE
DDAIASIHRI FKNDAAPEDI AAIVIEPVQG EGGFYAATPA FMQRLRALCD EHGIMLIADE
VQSGAGRTGT LFAMEQMGVA PDLTTFAKSI AGGFPLAGVT GRAEVMDAVA PGGLGGTYAG
NPIACVAALE VLKVFEQENL LQKANDLGQK LKDGLLAIAE KHPEIGDVRG LGAMIAIELF
EDGDHNKPDA KLTAEIVARA RDKGLILLSC GPYYNVLRIL VPLTIEDAQI RQGLEIISQC
FAEAKQ