Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_00110 |
Symbol | aroP |
ID | 8115388 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | - |
Start bp | 122983 |
End bp | 124356 |
Gene Length | 1374 bp |
Protein Length | 457 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 644846404 |
Product | hypothetical protein |
Protein accession | YP_002997977 |
Protein GI | 251783673 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1113] Gamma-aminobutyrate permease and related permeases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00481643 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGGAAG GTCAACAGCA CGGCGAGCAG CTAAAGCGCG GCCTTAAAAA CCGCCATATT CAGCTTATCG CGCTGGGTGG CGCGATAGGG ACCGGGTTAT TCCTGGGTAG CGCCTCCGTA ATACAGTCCG CAGGGCCAGG GATTATCCTG GGTTACGCCA TTGCTGGTTT TATCGCCTTT CTGATCATGC GTCAGCTGGG TGAAATGGTG GTCGAAGAAC CTGTCGCAGG CTCCTTTAGC CACTTTGCTT ATAAATACTG GGGCAGTTTT GCCGGTTTCG CCTCTGGCTG GAACTACTGG GTACTGTACG TTTTAGTTGC CATGGCTGAG CTGACTGCCG TGGGTAAATA CATTCAGTTC TGGTATCCGG AAATCCCCAC CTGGGTTTCT GCCGCCGTAT TCTTTGTGGT GATTAACGCC ATCAACCTGA CCAACGTTAA AGTGTTTGGC GAGATGGAGT TCTGGTTTGC CATTATCAAA GTTATCGCGG TGGTAGCGAT GATCATCTTC GGCGGCTGGC TGCTATTCAG TGGCAACGGC GGCCCGCAGG CGACCGTTAG CAACCTGTGG GATCAGGGTG GTTTCCTGCC GCACGGCTTC ACCGGGCTGG TGATGATGAT GGCGATTATC ATGTTCTCGT TCGGTGGTCT GGAACTGGTG GGGATCACCG CAGCAGAAGC TGATAACCCG GAGCAAAGTA TACCGAAAGC AACTAACCAG GTTATCTACC GCATCCTGAT TTTCTATATT GGTTCGTTAG CCGTTCTGCT CTCACTGATG CCGTGGACCC GCGTTACCGC CGATACCAGT CCGTTTGTGC TGATCTTCCA CGAGTTAGGC GATACCTTTG TGGCGAATGC GCTGAACATC GTGGTACTGA CTGCGGCGCT CTCCGTGTAC AACAGCTGCG TATATTGCAA CAGCCGTATG CTGTTTGGTC TGGCACAACA GGGTAATGCG CCAAAAGCGC TGGCGTCTGT CGATAAACGT GGTGTACCAG TAAATACCAT TCTGGTGTCT GCACTGGTAA CGGCGTTGTG CGTACTGATT AACTACCTTG CCCCAGAGTC CGCTTTCGGA CTGTTAATGG CGCTGGTGGT ATCTGCACTG GTAATCAACT GGGCGATGAT TAGCCTGGCG CATATGAAAT TCCGTCGCGC CAAGCAGGAA CAAGGCGTGG TAACTCGCTT CCCTGCTCTG CTTTATCCGC TGGGTAACTG GATCTGCCTG CTGTTTATGG CGGCGGTACT GGTGATTATG CTGATGACCC CAGGAATGGC GATTTCGGTA TACCTGATCC CGGTATGGCT GATCGTGTTA GGTATCGGCT ATCTGTTTAA AGAGAAAACC GCCAAAGCCG TAAAAGCGCA TTAA
|
Protein sequence | MMEGQQHGEQ LKRGLKNRHI QLIALGGAIG TGLFLGSASV IQSAGPGIIL GYAIAGFIAF LIMRQLGEMV VEEPVAGSFS HFAYKYWGSF AGFASGWNYW VLYVLVAMAE LTAVGKYIQF WYPEIPTWVS AAVFFVVINA INLTNVKVFG EMEFWFAIIK VIAVVAMIIF GGWLLFSGNG GPQATVSNLW DQGGFLPHGF TGLVMMMAII MFSFGGLELV GITAAEADNP EQSIPKATNQ VIYRILIFYI GSLAVLLSLM PWTRVTADTS PFVLIFHELG DTFVANALNI VVLTAALSVY NSCVYCNSRM LFGLAQQGNA PKALASVDKR GVPVNTILVS ALVTALCVLI NYLAPESAFG LLMALVVSAL VINWAMISLA HMKFRRAKQE QGVVTRFPAL LYPLGNWICL LFMAAVLVIM LMTPGMAISV YLIPVWLIVL GIGYLFKEKT AKAVKAH
|
| |