Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2531 |
Symbol | |
ID | 6143181 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2587863 |
End bp | 2589101 |
Gene Length | 1239 bp |
Protein Length | 412 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641617403 |
Product | aminotransferase |
Protein accession | YP_001744574 |
Protein GI | 170683338 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0436] Aspartate/tyrosine/aromatic aminotransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 67 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGACA CTCGCCCTGA ACGTCGCTTT ACGCGTATTG ATCGTCTCCC GCCCTACGTT TTTAATATTA CCGCTGAACT GAAAATGGCT GCGCGTCGGC GCGGCGAAGA TATTATCGAT TTCAGTATGG GTAACCCGGA CGGTGCGACT CCGCCGCATA TCGTCGAAAA ATTATGTACG GTGGCCCAGC GCCCGGACAC GCATGGTTAC TCCACGTCGC GCGGCATTCC GCGGTTACGT CGCGCCATTT CCCGCTGGTA TCAGGATCGC TACGGCGTTG AGATCGACCC GGAATCAGAA GCTATCGTCA CCATTGGTTC GAAAGAGGGA CTGGCGCATC TGATGCTGGC GACGCTGGAT CATGGTGACA CTGTACTGGT GCCTAATCCA AGCTACCCGA TTCATATTTA CGGCGCGGTG ATTGCCGGGG CGCAGGTACG CTCAGTGCCG CTGGTGGAAG GTGTCGATTT CTTCAACGAA CTGGAACGCG CCATTCGCGA AAGTTATCCG AAACCGAAGA TGATGATCCT CGGCTTCCCG TCGAACCCAA CCGCGCAATG CGTGGAGCTG GAATTCTTTG AGAAGGTTGT GGCGCTGGCG AAACGCTACG ATGTGCTGGT GGTCCATGAC CTGGCCTATG CCGATATCGT CTACGATGGC TGGAAAGCGC CGTCAATCAT GCAGGTGCCA GGTGCACGCG ATGTGGCGGT CGAGTTCTTT ACGCTGTCGA AAAGTTACAA CATGGCGGGC TGGCGTATCG GCTTTATGGT CGGTAACAAA ACGCTGGTCA GCGCGTTGGC ACGTATTAAA AGCTATCACG ATTACGGCAC CTTTACGCCG TTGCAGGTGG CAGCGATTGC GGCGCTGGAG GGCGATCAAC AGTGCGTGCG CGACATTGCT GAACAGTACA AACGCCGCCG TGATGTACTG GTAAAAGGGC TGCATGAAGC GGGCTGGATG GTCGAAATGC CGAAGGCTTC GATGTATGTC TGGGCGAAAA TCCCGGAACC ATATGCGGCC ATGGGTTCAC TGGAATTTGC CAAGAAGCTG CTTAACGAAG CGAAGGTCTG TGTCTCTCCA GGGATTGGCT TTGGCGACTA CGGCGACACC CATGTTCGCT TTGCACTGAT TGAAAACCGC GATCGTATTC GCCAGGCGAT TCGTGGAATT AAAGCGATGT TCCGTGCCGA CGGTTTACTA CCCGCCAGCA GCAAACATAT TCACGAAAAC GCGGAATAA
|
Protein sequence | MADTRPERRF TRIDRLPPYV FNITAELKMA ARRRGEDIID FSMGNPDGAT PPHIVEKLCT VAQRPDTHGY STSRGIPRLR RAISRWYQDR YGVEIDPESE AIVTIGSKEG LAHLMLATLD HGDTVLVPNP SYPIHIYGAV IAGAQVRSVP LVEGVDFFNE LERAIRESYP KPKMMILGFP SNPTAQCVEL EFFEKVVALA KRYDVLVVHD LAYADIVYDG WKAPSIMQVP GARDVAVEFF TLSKSYNMAG WRIGFMVGNK TLVSALARIK SYHDYGTFTP LQVAAIAALE GDQQCVRDIA EQYKRRRDVL VKGLHEAGWM VEMPKASMYV WAKIPEPYAA MGSLEFAKKL LNEAKVCVSP GIGFGDYGDT HVRFALIENR DRIRQAIRGI KAMFRADGLL PASSKHIHEN AE
|
| |