Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_03537 |
Symbol | tnaB |
ID | 8112815 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | + |
Start bp | 3777110 |
End bp | 3778357 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 644849707 |
Product | hypothetical protein |
Protein accession | YP_003001280 |
Protein GI | 251786976 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0814] Amino acid permeases |
TIGRFAM ID | [TIGR00837] aromatic amino acid transport protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGATC AAGCTGAAAA AAAGCACTCT GCATTTTGGG GTGTTATGGT TATAGCAGGT ACAGTAATTG GTGGAGGTAT GTTTGCTTTA CCTGTTGATC TTGCCGGTGC CTGGTTTTTC TGGGGTGCCT TTATCCTTAT CATTGCCTGG TTTTCAATGC TTCATTCCGG GTTATTGTTA TTAGAAGCAA ATTTAAATTA TCCTGTCGGC TCCAGTTTTA ACACCATCAC CAAAGATTTA ATCGGTAACA CCTGGAACAT TATCAGCGGT ATTACCGTTG CCTTCGTTCT CTATATCCTC ACTTATGCCT ATATCTCTGC TAATGGTGCG ATCATTAGTG AAACGATATC AATGAATTTG GGCTATCACG CTAATCCACG TATTGTCGGG ATCTGCACAG CCATTTTCGT TGCCAGCGTA TTGTGGATAA GTTCGTTAGC CGCCAGTCGC ATTACCTCAT TGTTCCTCGG GCTGAAGATT ATCTCCTTTG TGATCGTGTT TGGTTCTTTC TTCTTCCAGG TCGATTACTC CATCCTGCGC GATGCCACCA GCACCACTGC GGGAACGTCT TACTTCCCGT ATATCTTTAT GGCTTTGCCG GTGTGTCTGG CGTCATTTGG TTTCCACGGC AATATTCCCA GCCTGATTAT TTGCTATGGA AAACGCAAAG ATAAGTTAAT CAAAAGCGTG GTATTTGGTT CGCTGCTGGC GCTGGTGATT TATCTCTTCT GGCTCTATTG CACGATGGGG AATATTCCGC GCGAAAGCTT TAAGGCGATA ATCTCCTCAG GCGGCAACGT TGATTCGCTG GTGAAATCGT TCCTCGGCAC CAAACAGCAC GGCATTATCG AGTTTTGCCT GCTGGTGTTC TCTAACTTAG CTGTTGCCAG TTCGTTCTTT GGTGTCACGC TGGGGTTGTT CGATTATCTG GCGGACCTGT TTAAGATTGA TAACTCCCAC GGCGGGCGTT TCAAAACCGT GCTGTTAACC TTCCTGCCAC CTGCGTTGTT GTATCTGATC TTCCCGAACG GCTTTATTTA CGGGATCGGC GGTGCCGGGC TGTGCGCCAC CATCTGGGCG GTCATTATTC CCGCAGTGCT TGCAATCAAA GCTCGCAAGA AGTTTCCCAA TCAGATGTTC ACGGTCTGGG GCGGCAATCT TATTCCGGCG ATTGTCATTC TCTTTGGTAT AACCGTGATT TTGTGCTGGT TCGGCAACGT CTTTAACGTG TTACCTAAAT TTGGCTAA
|
Protein sequence | MTDQAEKKHS AFWGVMVIAG TVIGGGMFAL PVDLAGAWFF WGAFILIIAW FSMLHSGLLL LEANLNYPVG SSFNTITKDL IGNTWNIISG ITVAFVLYIL TYAYISANGA IISETISMNL GYHANPRIVG ICTAIFVASV LWISSLAASR ITSLFLGLKI ISFVIVFGSF FFQVDYSILR DATSTTAGTS YFPYIFMALP VCLASFGFHG NIPSLIICYG KRKDKLIKSV VFGSLLALVI YLFWLYCTMG NIPRESFKAI ISSGGNVDSL VKSFLGTKQH GIIEFCLLVF SNLAVASSFF GVTLGLFDYL ADLFKIDNSH GGRFKTVLLT FLPPALLYLI FPNGFIYGIG GAGLCATIWA VIIPAVLAIK ARKKFPNQMF TVWGGNLIPA IVILFGITVI LCWFGNVFNV LPKFG
|
| |