Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4515 |
Symbol | tyrB |
ID | 6144858 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4614681 |
End bp | 4615874 |
Gene Length | 1194 bp |
Protein Length | 397 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641619331 |
Product | aromatic amino acid aminotransferase |
Protein accession | YP_001746443 |
Protein GI | 170683808 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1448] Aspartate/tyrosine/aromatic aminotransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 55 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTTTCAAA AAGTTGACGC CTACGCTGGC GACCCGATTC TTACGCTTAT GGAGCGTTTT AAAGAAGACC CTCGCAGCGA CAAAGTGAAT TTAAGTATCG GTCTGTACTA CAACGAAGAG GGAATTATTC CACAACTGAA AGCCGTGGCG GATGCAGAAG CGCGCCTGAA TGCGCAGCCT CATGGCGCTT CGCTTTATTT ACCGATGGAA GGGCTTAACA GCTATCGCCA TGCCATTGCG CCGCTGCTGT TTGGTGCCGA CCATCCGGTA CTGCAACAAC AGCGCGTAGC AACCATTCAA ACCCTTGGCG GCTCAGGGGC ATTGAAAGTG GGCGCAGATT TCCTGAAACG CTACTTCCCG GAATCAGGCG TCTGGGTCAG CGATCCTACA TGGGAAAACC ACGTAGCAAT ATTCGCCGGG GCTGGATTCG AAGTAAGTAC TTACCCCTGG TATGACGAAG CGACTAACGG CGTGCGCTTT AATGACCTGT TGGCGACGCT GAAAACATTA CCTGCCCGCA GTATTGTGTT GCTGCATCCA TGTTGCCACA ACCCAACGGG TGCCGATCTC ACTAATAACC AGTGGGATGC GGTGATTGAA ATTCTCAAAG CCCGCGAGCT TATCCCATTT CTTGATATTG CCTATCAAGG ATTTGGTGCC GGTATGGAAG AGGATGCCTA CGCCATTCGC GCCATTGCCA GCGCTGGATT ACCCGCTCTG GTGAGCAATT CGTTCTCGAA AATTTTCTCC CTTTACGGCG AGCGCGTCGG CGGACTTTCT GTTCTGTGTG AAGATGCCGA AGCTGCAGAC CGCGTACTGG GGCAATTGAA AGCAACAGTT CGCCGCAACT ACTCCAGCCC GCCGAATTTT GGTGCGCAGG TGGTGGCAGC GGTGCTGAAT GACGAGGCAT TGAAAGCCAG CTGGCTGGCG GAAGTAGAAG AGATGCGTAC TCGCATTTTG GCAATGCGTC AGGAACTGGT GAAGGTATTA AGCACAGAGA TGCCAGAACG CAATTTCGAT TATCTGCTTA ATCAGCGCGG CATGTTCAGT TATACCGGTT TAAGTGCCGC TCAGGTTGAC CGACTACGTG AGGAGTTTGG TGTTTACCTG ATTACTAGCG GGCGCATGTG TGTCGCCGGG TTAAATACGG CAAATGTACA ACGTGTGGCA AAGGCGTTTG CTGCGGTGAT GTAA
|
Protein sequence | MFQKVDAYAG DPILTLMERF KEDPRSDKVN LSIGLYYNEE GIIPQLKAVA DAEARLNAQP HGASLYLPME GLNSYRHAIA PLLFGADHPV LQQQRVATIQ TLGGSGALKV GADFLKRYFP ESGVWVSDPT WENHVAIFAG AGFEVSTYPW YDEATNGVRF NDLLATLKTL PARSIVLLHP CCHNPTGADL TNNQWDAVIE ILKARELIPF LDIAYQGFGA GMEEDAYAIR AIASAGLPAL VSNSFSKIFS LYGERVGGLS VLCEDAEAAD RVLGQLKATV RRNYSSPPNF GAQVVAAVLN DEALKASWLA EVEEMRTRIL AMRQELVKVL STEMPERNFD YLLNQRGMFS YTGLSAAQVD RLREEFGVYL ITSGRMCVAG LNTANVQRVA KAFAAVM
|
| |