Gene EcSMS35_1820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1820 
SymbolgabT1 
ID6143459 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1839045 
End bp1840310 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content58% 
IMG OID641616696 
Product4-aminobutyrate transaminase 
Protein accessionYP_001743874 
Protein GI170682308 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0160] 4-aminobutyrate aminotransferase and related aminotransferases 
TIGRFAM ID[TIGR00700] 4-aminobutyrate aminotransferase, prokaryotic type 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.144249 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAACA ATGAATTCCA TCAGCGTCGT CTTTCTGCTA CCCCTCGCGG GGTTGGCGTG 
ATGTGTAACT TCTTCGCCCA GTCGGCAGAA AACGCCATGC TGAAGGACGT AGAGGGCAAC
GAATACATCG ATTTCGCCGC AGGCATTGCG GTGCTGAATA CCGGGCATCG CCACCCTGAT
CTGGTCGCGG CGGTGGAGCA GCAATTGCAA CAGTTTACCC ACACGGCGTA TCAGATTGTG
CCGTACGAAA GCTACGTCAC CCTGGCGGAG AAAATCAACG CCCTTGCCCC GGTGAGCGGG
CAGGCTAAAA CTGCGTTCTT CACCACCGGT GCGGAAGCGG TGGAGAACGC GGTGAAAATC
GCCCGCGCCC ATACCGGACG CCCTGGCGTG ATTGCGTTTA GCGGCGGCTT CCACGGTCGT
ACATATATGA CCATGGCGCT GACCGGAAAG GTCGCGCCGT ACAAAATCGG CTTCGGCCCG
TTCCCCGGTT CGGTATATCA CGTACCTTAT CCGTCAGATT TACATGGCGT TTCAACGCAG
GACTCTCTCG ACGCCATCGA ACGCTTGTTT AAATCTGACA TTGAAGCGAA GCAGGTGGCG
GCGATTATTT TCGAACCGGT GCAGGGCGAA GGCGGTTTCA ACGTTGCACC AAAAGAGCTG
GTTGCCGCCA TTCGCCGCCT GTGCGACGAG CACGGCATTG TGATGATTGC CGATGAAGTG
CAAAGCGGCT TTGCGCGTAC CGGTAAACTG TTTGCCATGG ATCACTACGC CGATAAGCCG
GACTTAATGA CGATGGCGAA AAGCCTCGCG GGCGGCATGC CGCTTTCGGG CGTGGTCGGT
AACGCGAATA TTATGGACGC GCCCGCGCCG GGCGGGTTGG GTGGTACTTA CGCCGGGAAC
CCGCTGGCGG TGGCTGCCGC GCACGCTGTG CTCAACATTA TCGACAAAGA ATCACTCTGT
GAACGCGCGA ATCAACTGGG CCAGCGCCTG ACAAACACGT TGATTGATGC CAAAGAAAGC
GTTCCGGCCA TCGCGGCGGT ACGCGGTCTG GGGTCTATGA TTGCGGCAGA GTTTAACGAT
CCGCAAACGG GCGAGCCGTC AGCGGCGATT GCACAGAAAA TCCAGCAACG CGCGCTGGCG
CAGGGGCTGC TTCTGCTGAC CTGTGGCGCA TACGGCAACG TGATTCGTTT CCTGTATCCG
CTGACCATCC CGGATGCGCA ATTCGATGCG GCAATGAAAA TTTTGCAGGA TGCGCTGAAA
GATTAA
 
Protein sequence
MSNNEFHQRR LSATPRGVGV MCNFFAQSAE NAMLKDVEGN EYIDFAAGIA VLNTGHRHPD 
LVAAVEQQLQ QFTHTAYQIV PYESYVTLAE KINALAPVSG QAKTAFFTTG AEAVENAVKI
ARAHTGRPGV IAFSGGFHGR TYMTMALTGK VAPYKIGFGP FPGSVYHVPY PSDLHGVSTQ
DSLDAIERLF KSDIEAKQVA AIIFEPVQGE GGFNVAPKEL VAAIRRLCDE HGIVMIADEV
QSGFARTGKL FAMDHYADKP DLMTMAKSLA GGMPLSGVVG NANIMDAPAP GGLGGTYAGN
PLAVAAAHAV LNIIDKESLC ERANQLGQRL TNTLIDAKES VPAIAAVRGL GSMIAAEFND
PQTGEPSAAI AQKIQQRALA QGLLLLTCGA YGNVIRFLYP LTIPDAQFDA AMKILQDALK
D