Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_03298 |
Symbol | yhiP |
ID | 8113338 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | + |
Start bp | 3502459 |
End bp | 3503928 |
Gene Length | 1470 bp |
Protein Length | 489 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 644849475 |
Product | hypothetical protein |
Protein accession | YP_003001048 |
Protein GI | 251786744 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3104] Dipeptide/tripeptide permease |
TIGRFAM ID | [TIGR00924] amino acid/peptide transporter (Peptide:H+ symporter), bacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.240113 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATACAA CAACACCCAT GGGGATGCTG CAGCAACCTC GCCCATTTTT CATGATCTTT TTTGTCGAGT TATGGGAGCG ATTCGGCTAC TACGGCGTGC AGGGCGTACT GGCGGTTTTC TTCGTTAAAC AGCTTGGATT CTCGCAAGAG CAGGCTTTTG TCACTTTTGG TGCTTTTGCT GCGCTGGTCT ATGGCCTCAT TTCCATTGGC GGCTATGTCG GCGACCACCT GCTGGGGACC AAACGCACCA TTGTTCTCGG AGCACTTGTG CTGGCGATTG GCTACTTCAT GACCGGCATG TCGCTACTTA AGCCTGACCT GATTTTCATC GCCCTGGGGA CTATCGCTGT CGGTAACGGC CTGTTTAAAG CTAACCCAGC CAGCTTGCTT TCGAAGTGCT ATCCGCCGAA AGATCCGCGG CTTGATGGCG CATTCACCCT GTTCTATATG TCGATCAATA TCGGCTCGTT GATAGCGTTA TCGCTGGCCC CTGTGATCGC TGATAGATTC GGTTATTCAG TCACCTACAA CCTGTGCGGG GCGGGGTTAA TTATCGCATT ACTGGTTTAC ATCGCCTGTC GTGGAATGGT GAAAGACATT GGTTCTGAAC CCGACTTCAA GCCGATGAGC TTCAGCAAAC TGTTGTACGT ATTACTTGGC AGCGTGGTGA TGATCTTCGT ATGTGCATGG CTGATGCACA ACGTAGAAGT CGCCAATCTG GTGCTGATTG TTCTCTCCAT CGTCGTCACC ATCATCTTCT TTCGTCAGGC ATTCAAGCTG GATAAAACCG GGCGCAATAA AATGTTTGTC GCCTTTGTCC TGATGCTCGA AGCGGTGGTG TTTTACATTC TCTACGCCCA GATGCCAACA TCGCTGAACT TCTTTGCCAT CAACAACGTG CATCATGAAA TTCTCGGTTT TTCCATCAAC CCGGTCAGCT TCCAGGCGCT TAACCCGTTC TGGGTGGTAC TCGCCAGCCC AATACTGGCA GGCATTTACA CGCATCTGGG TAACAAAGGC AAAGACCTCT CGATGCCGAT GAAATTTACT CTCGGCATGT TTATGTGCTC TCTGGGCTTT TTGACGGCTG CAGCTGCGGG AATGTGGTTT GCGGATGCCC AGGGGCTGAC ATCGCCATGG TTTATCGTGC TGGTGTACTT ATTCCAGAGC TTAGGTGAAC TGTTTATTAG CGCCCTTGGC CTGGCGATGA TTGCTGCCCT GGTGCCGCAG CATTTGATGG GCTTTATTCT CGGGATGTGG TTCCTGACGC AGGCTGCCGC GTTCTTGCTG GGCGGCTATG TGGCAACATT TACCGCGGTG CCGGACAACA TTACCGATCC GCTTGAGACG TTGCCCGTCT ATACCAACGT GTTTGGTAAG ATTGGTCTGG TCACACTGGG CGTTGCAGTA GTGATGCTGT TGATGGTGCC GTGGCTGAAA CGCATGATTG CGACGCCGGA AAGCCATTAA
|
Protein sequence | MNTTTPMGML QQPRPFFMIF FVELWERFGY YGVQGVLAVF FVKQLGFSQE QAFVTFGAFA ALVYGLISIG GYVGDHLLGT KRTIVLGALV LAIGYFMTGM SLLKPDLIFI ALGTIAVGNG LFKANPASLL SKCYPPKDPR LDGAFTLFYM SINIGSLIAL SLAPVIADRF GYSVTYNLCG AGLIIALLVY IACRGMVKDI GSEPDFKPMS FSKLLYVLLG SVVMIFVCAW LMHNVEVANL VLIVLSIVVT IIFFRQAFKL DKTGRNKMFV AFVLMLEAVV FYILYAQMPT SLNFFAINNV HHEILGFSIN PVSFQALNPF WVVLASPILA GIYTHLGNKG KDLSMPMKFT LGMFMCSLGF LTAAAAGMWF ADAQGLTSPW FIVLVYLFQS LGELFISALG LAMIAALVPQ HLMGFILGMW FLTQAAAFLL GGYVATFTAV PDNITDPLET LPVYTNVFGK IGLVTLGVAV VMLLMVPWLK RMIATPESH
|
| |