Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0621 |
Symbol | |
ID | 6143367 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 634440 |
End bp | 635600 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641615513 |
Product | putative aminotransferase |
Protein accession | YP_001742719 |
Protein GI | 170683545 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0436] Aspartate/tyrosine/aromatic aminotransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 60 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAAATA ACCCTCTGAT TCCACAAAGC AAACTTCCAC AACTTGGCAC CACTATTTTC ACCCAGATGA GTGCACTGGC GCAGCAACAC CAGGCGATTA ACCTGTCGCA AGGCTTTCCT GATTTTGATG GTCCGCGCTA TTTGCAGGAA CGGCTGGCGT ACCACGTTGC CCAGGGGGCA AACCAATACG CACCCATGAC CGGCGTGCAG GCATTACGCG AGGCGATTGC TCAGAAAACG GAACGGCTGT ATGGCTATCA ACCAGATGCC GACAGCGATA TCACCGTAAC AGCAGGGGCG ACGGAAGCGC TATACGCAGC GATTACTGCA TTGGTTCGCA ATGGCGATGA AGTGATTTGC TTTGATCCCA GCTATGACAG TTACGCGCCC GCCATCGCGC TTTCGGGGGG AATAGTGAAA CGTATTGCAC TGCAACCACC GCATTTTCGC GTGGACTGGC AGGAATTTGC CGCATTGTTA AGCGAGCGCA CCCGACTGGT GATCCTCAAC ACCCCGCATA ACCCCAGTGC AACTGTCTGG CAGCAGGCTG ATTTCTCCGC TCTGTGGCAG GCGATTGCCG GGCACGAGAT TTTTGTCATT AGCGATGAAG TCTACGAGCA CATCAATTTT TCACAACAGG GCCATGCCAG CGTGCTGGCG CATCCGCAAT TGTGTGAGCG AGCGGTGGCG GTGTCATCGT TTGGCAAGAC CTATCATATG ACCGGCTGGA AAGTGGGGTA TTGCGTTGCA CCTGCGCCCA TCAGCGCCGA GATCCGCAAA GTGCATCAGT ATCTGACCTT TTCGGTGAAT ACCCCGGCAC AGTTGGCGCT TGCCGATATG CTACGTGCAG AACCTGAGCA TTATCTTGCG TTACCGGACT TTTATCGCCA GAAGCGCGAT ATTCTGGTGA ATGCCTTAAA TGAAAGTCGG CTGGAGATTT TACCGTGCGA AGGTACATAC TTTTTGCTGG TGGATTACAG CGCCGTATCT ACGCTGGATG ATGTTGAGTT TTGCCAGTGG CTGACGCGGG AGCACGGCGT AGCGGCGATT CCGCTGTCGG TGTTTTGCGC CGATCCCTTC CCACATAAAC TGATTCGTCT CTGTTTTGCC AAGAAGGAAT CGACGTTGCT GGCAGCAGCA GAACGCCTGC GCCAGCTTTA G
|
Protein sequence | MTNNPLIPQS KLPQLGTTIF TQMSALAQQH QAINLSQGFP DFDGPRYLQE RLAYHVAQGA NQYAPMTGVQ ALREAIAQKT ERLYGYQPDA DSDITVTAGA TEALYAAITA LVRNGDEVIC FDPSYDSYAP AIALSGGIVK RIALQPPHFR VDWQEFAALL SERTRLVILN TPHNPSATVW QQADFSALWQ AIAGHEIFVI SDEVYEHINF SQQGHASVLA HPQLCERAVA VSSFGKTYHM TGWKVGYCVA PAPISAEIRK VHQYLTFSVN TPAQLALADM LRAEPEHYLA LPDFYRQKRD ILVNALNESR LEILPCEGTY FLLVDYSAVS TLDDVEFCQW LTREHGVAAI PLSVFCADPF PHKLIRLCFA KKESTLLAAA ERLRQL
|
| |