Gene EcolC_1044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1044 
Symbol 
ID6066436 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1130155 
End bp1131435 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content59% 
IMG OID641600457 
Product4-aminobutyrate aminotransferase 
Protein accessionYP_001724040 
Protein GI170019086 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0160] 4-aminobutyrate aminotransferase and related aminotransferases 
TIGRFAM ID[TIGR00700] 4-aminobutyrate aminotransferase, prokaryotic type 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAGCA ATAAAGAGTT AATGCAGCGC CGCAGTCAGG CGATTCCCCG TGGTGTTGGG 
CAAATTCACC CGATTTTCGC TGACCGCGCG GAAAACTGCC GGGTGTGGGA CGTTGAAGGC
CGTGAGTATC TTGATTTCGC GGGCGGGATT GCGGTGCTCA ATACCGGGCA CTTGCATCCG
AAAGTGGTGG CTGCGGTGGA AGCGCAGTTG AAAAAACTGT CGCACACCTG CTTCCAGGTG
CTGGCCTACG AGCCGTATCT GGAGCTGTGC GAGATTATGA ATCAGAAGGT GCCAGGCAAT
TTTGCCAAGA AAACGCTGCT GGTCACCACA GGTTCTGAAG CGGTGGAAAA CGCGGTGAAA
ATCGCCCGCG CCGCCACCAA ACGTAGCGGC ACCATCGCTT TTAGCGGCGC GTATCACGGG
CGCACGCATT ACACGCTGGC GCTGACCGGC AAGGTGAATC CGTACTCTGC GGGCATGGGG
CTGATGCCGG GTCATGTTTA TCGCGCGCTT TATCCTTGCC CGCTGCACGG CATAAGCGAG
GATGACGCTA TCGCCAGCAT CCACCGGATC TTCAAAAATG ATGCCGCGCC GGAAGATATC
GCCGCCATCG TGATTGAGCC GGTTCAGGGC GAAGGCGGTT TCTACGCCTC GTCGCCAGCC
TTTATGCAGC GTTTACGCGC TCTGTGTGAC GAGCACGGGA TCATGCTGAT TGCCGATGAA
GTGCAGAGCG GCGCGGGGCG TACCGGCACG CTGTTTGCGA TGGAGCAGAT GGGCGTTGCG
CCGGATCTTA CCACCTTTGC GAAATCGATC GCGGGCGGCT TCCCGCTGGC GGGCGTCACC
GGGCGCGCGG AAGTAATGGA TGCCGTCGCT CCAGGCGGTC TGGGCGGCAC CTATGCGGGT
AACCCGATTG CCTGCGTGGC TGCGCTGGAA GTGTTGAAGG TGTTTGAGCA GGAAAATCTG
CTGCAAAAAG CCAACGATCT GGGGCAGAAG TTGAAAGACG GATTGCTGGC GATAGCCGAA
AAACACCCGG AGATCGGCGA CGTACGCGGG CTGGGGGCGA TGATCGCCAT TGAGCTGTTT
GAAGACGGCG ATCACAACAA GCCGGACGCC AAACTCACCG CCGAGATCGT GGCTCGCGCC
CGCGATAAAG GCCTGATTCT TCTCTCCTGC GGCCCGTATT ACAACGTGCT GCGCATCCTT
GTACCGCTCA CCATTGAAGA CGCTCAGATC CGTCAGGGTC TGGAGATCAT CAGCCAGTGT
TTTGATGAGG CGAAGCAGTA G
 
Protein sequence
MSSNKELMQR RSQAIPRGVG QIHPIFADRA ENCRVWDVEG REYLDFAGGI AVLNTGHLHP 
KVVAAVEAQL KKLSHTCFQV LAYEPYLELC EIMNQKVPGN FAKKTLLVTT GSEAVENAVK
IARAATKRSG TIAFSGAYHG RTHYTLALTG KVNPYSAGMG LMPGHVYRAL YPCPLHGISE
DDAIASIHRI FKNDAAPEDI AAIVIEPVQG EGGFYASSPA FMQRLRALCD EHGIMLIADE
VQSGAGRTGT LFAMEQMGVA PDLTTFAKSI AGGFPLAGVT GRAEVMDAVA PGGLGGTYAG
NPIACVAALE VLKVFEQENL LQKANDLGQK LKDGLLAIAE KHPEIGDVRG LGAMIAIELF
EDGDHNKPDA KLTAEIVARA RDKGLILLSC GPYYNVLRIL VPLTIEDAQI RQGLEIISQC
FDEAKQ