Gene EcSMS35_2192 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2192 
SymbolaspC 
ID6146560 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2201720 
End bp2202910 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content52% 
IMG OID641617068 
Productaromatic amino acid aminotransferase 
Protein accessionYP_001744242 
Protein GI170682145 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1448] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value0.330977 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTGAGA ACATTACCGC CGCTCCTGCC GACCCGATTC TGGGCCTGGC CGATCTGTTT 
CGTGCCGATG AACGTCCCGG CAAAATTAAC CTCGGGATTG GTGTCTATAA AGATGAGACG
GGCAAAACCC CGGTACTGAC CAGCGTGAAA AAGGCTGAAC AGTATTTGCT CGAAAATGAA
ACCACCAAAA ATTACCTCGG CATTGACGGC ATCCCTGAAT TTGGTCGCTG CACTCAGGAA
CTGCTGTTTG GTAAAGGTAG CGCCCTGATC AATGACAAAC GTGCTCGCAC GGCACAGACT
CCGGGTGGCA CTGGCGCACT ACGCGTGGCT GCCGATTTCC TGGCAAAAAA TACCAGCGTT
AAGCGTGTGT GGGTGAGCAA CCCAAGCTGG CCGAACCATA AGAGCGTCTT TAACTCTGCA
GGTCTGGAAG TTCGTGAATA CGCTTATTAT GATGCGGAAA ATCACACCCT TGACTTCGAT
GCACTGATTA ACAGCCTGAA CGAAGCTCAG GCTGGCGACG TAGTGCTGTT CCATGGCTGC
TGCCATAACC CAACCGGTAT CGACCCTACG CTGGAACAAT GGCAGACACT GGCACAACTT
TCCGTTGAGA AAGGCTGGTT ACCGCTGTTT GACTTCGCTT ACCAGGGTTT TGCCCGTGGT
CTGGAAGAAG ATGCTGAAGG ACTGCGCGCT TTCGCTGCTA TGCATAAAGA GCTGATTGTT
GCCAGTTCCT ACTCTAAAAA CTTTGGCCTG TACAACGAGC GTGTTGGCGC TTGTACTCTG
GTTGCTGCTG ACAGTGAGAC CGTTGATCGC GCATTCAGCC AAATGAAAGC GGCGATTCGC
GCTAACTACT CTAACCCACC AGCACACGGC GCTTCTGTTG TTGCCACCAT CCTGAGCAAC
GATGCGTTAC GTGCGATTTG GGAACAAGAG CTGACTGATA TGCGCCAGCG TATTCAGCGT
ATGCGTCAGT TGTTCGTCAA TACGCTGCAG GAAAAAGGGG CAAACCGCGA CTTCAGCTTT
ATCATCAAAC AGAACGGTAT GTTCTCCTTC AGTGGCCTGA CGAAAGAACA GGTACTGCGT
CTGCGTGAAG AGTTTGGCGT GTATGCAGTG GCTTCTGGTC GTGTGAACGT GGCCGGGATG
ACGCCAGATA ACATGGCTCC GCTGTGCGAA GCGATTGTGG CCGTGCTGTA A
 
Protein sequence
MFENITAAPA DPILGLADLF RADERPGKIN LGIGVYKDET GKTPVLTSVK KAEQYLLENE 
TTKNYLGIDG IPEFGRCTQE LLFGKGSALI NDKRARTAQT PGGTGALRVA ADFLAKNTSV
KRVWVSNPSW PNHKSVFNSA GLEVREYAYY DAENHTLDFD ALINSLNEAQ AGDVVLFHGC
CHNPTGIDPT LEQWQTLAQL SVEKGWLPLF DFAYQGFARG LEEDAEGLRA FAAMHKELIV
ASSYSKNFGL YNERVGACTL VAADSETVDR AFSQMKAAIR ANYSNPPAHG ASVVATILSN
DALRAIWEQE LTDMRQRIQR MRQLFVNTLQ EKGANRDFSF IIKQNGMFSF SGLTKEQVLR
LREEFGVYAV ASGRVNVAGM TPDNMAPLCE AIVAVL