Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_3547 |
Symbol | |
ID | 6064691 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 3878033 |
End bp | 3879403 |
Gene Length | 1371 bp |
Protein Length | 456 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641602964 |
Product | aromatic amino acid transporter |
Protein accession | YP_001726488 |
Protein GI | 170021534 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1113] Gamma-aminobutyrate permease and related permeases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0288146 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.00579888 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGAAGGTC AACAGCACGG CGAGCAGCTA AAGCGCGGCC TTAAAAACCG CCATATTCAG CTTATCGCGC TGGGTGGCGC GATAGGGACC GGGTTATTCC TGGGTAGCGC CTCCGTAATA CAGTCCGCAG GGCCAGGGAT TATCCTGGGT TACGCCATTG CTGGTTTTAT CGCCTTTCTG ATCATGCGTC AGCTGGGTGA AATGGTGGTC GAAGAACCTG TCGCAGGCTC CTTTAGCCAC TTTGCTTATA AATACTGGGG CAGTTTTGCC GGTTTCGCCT CTGGCTGGAA CTACTGGGTA CTGTACGTTT TAGTTGCCAT GGCTGAGCTG ACTGCCGTGG GTAAATACAT TCAGTTCTGG TATCCGGAAA TCCCCACCTG GGTTTCTGCC GCCGTATTCT TTGTGGTGAT TAACGCCATC AACCTGACCA ACGTTAAAGT GTTTGGCGAG ATGGAGTTCT GGTTTGCCAT TATCAAAGTT ATCGCGGTGG TCGCGATGAT CATCTTCGGC GGCTGGCTGC TATTCAGTGG CAACGGCGGC CCGCAGGCGA CCGTTAGCAA CCTGTGGGAT CAGGGTGGTT TCCTGCCGCA CGGCTTCACC GGGCTGGTGA TGATGATGGC GATTATCATG TTCTCGTTCG GTGGTCTGGA ACTGGTGGGG ATCACCGCAG CAGAAGCTGA TAACCCGGAG CAAAGTATCC CGAAAGCGAC TAACCAGGTT ATCTACCGCA TCCTGATTTT CTATATTGGT TCGTTAGCCG TTCTGCTCTC ACTGATGCCG TGGACCCGCG TTACCGCCGA TACCAGTCCG TTTGTGCTGA TCTTCCACGA GTTAGGCGAT ACCTTTGTGG CGAATGCGCT GAACATCGTG GTACTGACTG CGGCGCTCTC CGTGTACAAC AGCTGCGTAT ATTGCAACAG CCGTATGCTG TTTGGTCTGG CACAACAGGG TAATGCGCCA AAAGCGCTGG CGTCTGTCGA TAAACGCGGC GTACCGGTTA ACACCATTCT GGTGTCTGCG CTGGTTACAG CATTGTGCGT ATTGATTAAC TATCTTGCTC CGGAATCCGC ATTTGGCCTG TTAATGGCAC TGGTGGTATC CGCACTGGTG ATCAACTGGG CGATGATCAG TCTGGCGCAT ATGAAGTTCC GTCGCGCCAA GCAGGAACAA GGCGTGGTAA CTCGCTTCCC TGCTCTGCTT TATCCGCTGG GTAACTGGAT CTGCCTGCTG TTTATGGCGG CGGTACTGGT GATTATGCTG ATGACCCCAG GAATGGCGAT TTCGGTATAC CTGATCCCGG TATGGCTGAT CGTGTTAGGT ATCGGCTATC TGTTTAAAGA GAAAACCGCC AAAGCCGTAA AAGCGCATTA A
|
Protein sequence | MEGQQHGEQL KRGLKNRHIQ LIALGGAIGT GLFLGSASVI QSAGPGIILG YAIAGFIAFL IMRQLGEMVV EEPVAGSFSH FAYKYWGSFA GFASGWNYWV LYVLVAMAEL TAVGKYIQFW YPEIPTWVSA AVFFVVINAI NLTNVKVFGE MEFWFAIIKV IAVVAMIIFG GWLLFSGNGG PQATVSNLWD QGGFLPHGFT GLVMMMAIIM FSFGGLELVG ITAAEADNPE QSIPKATNQV IYRILIFYIG SLAVLLSLMP WTRVTADTSP FVLIFHELGD TFVANALNIV VLTAALSVYN SCVYCNSRML FGLAQQGNAP KALASVDKRG VPVNTILVSA LVTALCVLIN YLAPESAFGL LMALVVSALV INWAMISLAH MKFRRAKQEQ GVVTRFPALL YPLGNWICLL FMAAVLVIML MTPGMAISVY LIPVWLIVLG IGYLFKEKTA KAVKAH
|
| |