Gene ECH74115_1089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1089 
SymbolaspC 
ID6966649 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1117346 
End bp1118536 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content52% 
IMG OID643385101 
Productaromatic amino acid aminotransferase 
Protein accessionYP_002269600 
Protein GI209397827 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1448] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00147404 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value0.202333 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTGAGA ACATTACCGC CGCTCCTGCC GACCCGATTC TGGGCCTGGC CGATCTGTTT 
CGTGCCGATG AACGTCCCGG CAAAATTAAC CTCGGGATTG GTGTCTATAA AGATGAGACG
GGCAAAACCC CGGTACTGAC CAGCGTGAAA AAGGCTGAAC AGTATCTGCT CGAAAATGAA
ACCACCAAAA ATTACCTCGG CATTGACGGC ATCCCTGAAT TTGGTCGCTG CACTCAGGAA
CTGCTGTTTG GTAAAGGTAG CGCCCTGATC AATGACAAAC GTGCTCGCAC GGCACAGACT
CCGGGTGGCA CTGGCGCACT ACGCATAGCT GCCGATTTCC TGGCAAAAAA TACCAGCGTT
AAGCGAGTGT GGGTGAGCAA CCCAAGCTGG CCGAACCATA AGAGCGTCTT TAACTCTGCA
GATCTGGAAG TTCGTGAATA CGCTTATTAT GATGCGGAAA ACCACACCCT TGACTTCGAT
GCACTGATTA ACAGCCTGAA CGAAGCTCAG GCTGGCGACG TAGTGCTGTT CCATGGCTGC
TGCCACAACC CAACCGGTAT CGACCCTACG CTGGAACAAT GGCAGACACT GGCACAACTC
TCCGTTGAGA AAGGCTGGTT ACCGCTGTTT GACTTCGCTT ACCAGGGTTT TGCCCGTGGT
CTGGAAGAAG ATGCTGAAGG ACTGCGCGCT TTCGCGGCTA TGCATAAAGA GCTGATTGTT
GCCAGTTCCT ACTCTAAAAA CTTTGGCCTG TACAACGAGC GTGTTGGCGC TTGTACTCTG
GTTGCTGCCG ACAGTGAAAC CGTTGATCGC GCATTCAGCC AAATGAAAGC GGCGATTCGC
GCTAACTACT CTAACCCACC AGCACACGGC GCTTCTGTTG TTGCCACCAT CCTGAGCAAC
GATGCGTTAC GTGCGATTTG GGAACAAGAG CTGACTGATA TGCGCCAGCG TATTCAGCGT
ATGCGTCAGT TGTTCGTCAA TACGCTGCAG GAAAAAGGCG CAAACCGCGA CTTCAGCTTT
ATCATCAAAC AGAACGGCAT GTTCTCCTTC AGTGGCCTGA CAAAAGAACA AGTGCTGCGT
CTGCGCGAAG AGTTTGGCGT GTATGCTGTT GCTTCTGGTC GCGTAAACGT GGCCGGGATG
ACACCAGATA ACATGGCTCC GCTGTGCGAA GCGATTGTGG CAGTGCTGTA A
 
Protein sequence
MFENITAAPA DPILGLADLF RADERPGKIN LGIGVYKDET GKTPVLTSVK KAEQYLLENE 
TTKNYLGIDG IPEFGRCTQE LLFGKGSALI NDKRARTAQT PGGTGALRIA ADFLAKNTSV
KRVWVSNPSW PNHKSVFNSA DLEVREYAYY DAENHTLDFD ALINSLNEAQ AGDVVLFHGC
CHNPTGIDPT LEQWQTLAQL SVEKGWLPLF DFAYQGFARG LEEDAEGLRA FAAMHKELIV
ASSYSKNFGL YNERVGACTL VAADSETVDR AFSQMKAAIR ANYSNPPAHG ASVVATILSN
DALRAIWEQE LTDMRQRIQR MRQLFVNTLQ EKGANRDFSF IIKQNGMFSF SGLTKEQVLR
LREEFGVYAV ASGRVNVAGM TPDNMAPLCE AIVAVL