Gene B21_03298 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_03298 
SymbolyhiP 
ID8113338 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp3502459 
End bp3503928 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content52% 
IMG OID644849475 
Producthypothetical protein 
Protein accessionYP_003001048 
Protein GI251786744 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3104] Dipeptide/tripeptide permease 
TIGRFAM ID[TIGR00924] amino acid/peptide transporter (Peptide:H+ symporter), bacterial 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.240113 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATACAA CAACACCCAT GGGGATGCTG CAGCAACCTC GCCCATTTTT CATGATCTTT 
TTTGTCGAGT TATGGGAGCG ATTCGGCTAC TACGGCGTGC AGGGCGTACT GGCGGTTTTC
TTCGTTAAAC AGCTTGGATT CTCGCAAGAG CAGGCTTTTG TCACTTTTGG TGCTTTTGCT
GCGCTGGTCT ATGGCCTCAT TTCCATTGGC GGCTATGTCG GCGACCACCT GCTGGGGACC
AAACGCACCA TTGTTCTCGG AGCACTTGTG CTGGCGATTG GCTACTTCAT GACCGGCATG
TCGCTACTTA AGCCTGACCT GATTTTCATC GCCCTGGGGA CTATCGCTGT CGGTAACGGC
CTGTTTAAAG CTAACCCAGC CAGCTTGCTT TCGAAGTGCT ATCCGCCGAA AGATCCGCGG
CTTGATGGCG CATTCACCCT GTTCTATATG TCGATCAATA TCGGCTCGTT GATAGCGTTA
TCGCTGGCCC CTGTGATCGC TGATAGATTC GGTTATTCAG TCACCTACAA CCTGTGCGGG
GCGGGGTTAA TTATCGCATT ACTGGTTTAC ATCGCCTGTC GTGGAATGGT GAAAGACATT
GGTTCTGAAC CCGACTTCAA GCCGATGAGC TTCAGCAAAC TGTTGTACGT ATTACTTGGC
AGCGTGGTGA TGATCTTCGT ATGTGCATGG CTGATGCACA ACGTAGAAGT CGCCAATCTG
GTGCTGATTG TTCTCTCCAT CGTCGTCACC ATCATCTTCT TTCGTCAGGC ATTCAAGCTG
GATAAAACCG GGCGCAATAA AATGTTTGTC GCCTTTGTCC TGATGCTCGA AGCGGTGGTG
TTTTACATTC TCTACGCCCA GATGCCAACA TCGCTGAACT TCTTTGCCAT CAACAACGTG
CATCATGAAA TTCTCGGTTT TTCCATCAAC CCGGTCAGCT TCCAGGCGCT TAACCCGTTC
TGGGTGGTAC TCGCCAGCCC AATACTGGCA GGCATTTACA CGCATCTGGG TAACAAAGGC
AAAGACCTCT CGATGCCGAT GAAATTTACT CTCGGCATGT TTATGTGCTC TCTGGGCTTT
TTGACGGCTG CAGCTGCGGG AATGTGGTTT GCGGATGCCC AGGGGCTGAC ATCGCCATGG
TTTATCGTGC TGGTGTACTT ATTCCAGAGC TTAGGTGAAC TGTTTATTAG CGCCCTTGGC
CTGGCGATGA TTGCTGCCCT GGTGCCGCAG CATTTGATGG GCTTTATTCT CGGGATGTGG
TTCCTGACGC AGGCTGCCGC GTTCTTGCTG GGCGGCTATG TGGCAACATT TACCGCGGTG
CCGGACAACA TTACCGATCC GCTTGAGACG TTGCCCGTCT ATACCAACGT GTTTGGTAAG
ATTGGTCTGG TCACACTGGG CGTTGCAGTA GTGATGCTGT TGATGGTGCC GTGGCTGAAA
CGCATGATTG CGACGCCGGA AAGCCATTAA
 
Protein sequence
MNTTTPMGML QQPRPFFMIF FVELWERFGY YGVQGVLAVF FVKQLGFSQE QAFVTFGAFA 
ALVYGLISIG GYVGDHLLGT KRTIVLGALV LAIGYFMTGM SLLKPDLIFI ALGTIAVGNG
LFKANPASLL SKCYPPKDPR LDGAFTLFYM SINIGSLIAL SLAPVIADRF GYSVTYNLCG
AGLIIALLVY IACRGMVKDI GSEPDFKPMS FSKLLYVLLG SVVMIFVCAW LMHNVEVANL
VLIVLSIVVT IIFFRQAFKL DKTGRNKMFV AFVLMLEAVV FYILYAQMPT SLNFFAINNV
HHEILGFSIN PVSFQALNPF WVVLASPILA GIYTHLGNKG KDLSMPMKFT LGMFMCSLGF
LTAAAAGMWF ADAQGLTSPW FIVLVYLFQS LGELFISALG LAMIAALVPQ HLMGFILGMW
FLTQAAAFLL GGYVATFTAV PDNITDPLET LPVYTNVFGK IGLVTLGVAV VMLLMVPWLK
RMIATPESH