Gene B21_01231 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_01231 
SymboloppF 
ID8114271 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp1290762 
End bp1291766 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content52% 
IMG OID644847482 
Producthypothetical protein 
Protein accessionYP_002999055 
Protein GI251784751 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4608] ABC-type oligopeptide transport system, ATPase component 
TIGRFAM ID[TIGR01727] oligopeptide/dipeptide ABC transporter, ATP-binding protein, C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0162122 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGCTG TAACTGAAGG AAGAAAAGTC CTCCTTGAAA TCGCCGATCT TAAAGTGCAC 
TTTGAAATCA AAGATGGCAA ACAGTGGTTC TGGCAACCGC CGAAAACGCT CAAAGCCGTC
GATGGTGTCA CTCTTCGCCT GTATGAAGGG GAAACATTAG GTGTGGTAGG GGAATCGGGA
TGCGGTAAGT CCACCTTTGC TCGCGCCATC ATCGGTTTGG TCAAGGCGAC CGACGGTCAT
GTTGCCTGGT TAGGTAAAGA GTTGCTGGGC ATGAAGCCCG ATGAATGGCG TGCCGTTCGC
AGTGATATTC AGATGATTTT CCAGGATCCG TTGGCATCGC TAAACCCGCG TATGACCATC
GGCGAGATCA TCGCTGAACC ACTGCGTACT TATCATCCGA AAATGTCACG CCAGGAAGTT
CGCGAGCGCG TGAAGGCGAT GATGCTGAAA GTCGGGTTAT TGCCTAACCT GATTAACCGC
TATCCGCATG AGTTCTCCGG TGGGCAGTGC CAGCGTATCG GGATTGCTCG TGCTCTTATT
CTTGAACCGA AGCTGATTAT CTGCGATGAG CCGGTGTCGG CGCTGGACGT GTCAATTCAG
GCGCAGGTGG TCAACCTGCT CCAGCAGCTG CAACGTGAGA TGGGATTGTC ATTAATTTTT
ATCGCTCATG ACCTGGCCGT GGTAAAACAC ATTTCCGATC GTGTGTTGGT GATGTATCTC
GGCCATGCGG TAGAACTGGG GACCTATGAT GAGGTCTACC ACAATCCACT ACATCCTTAC
ACCAGGGCAT TGATGTCGGC AGTCCCCATA CCTGATCCGG ATCTGGAGAA GAACAAAACC
ATCCAGTTAC TGGAAGGGGA ATTACCGTCG CCGATCAACC CGCCTTCCGG TTGTGTTTTC
CGTACCCGTT GCCCGATTGC CGGTCCGGAG TGCGCCAAAA CACGTCCTGT TCTGGAGGGG
AGTTTCAGAC ACGCCGTTTC TTGCCTGAAA GTCGATCCGC TTTAA
 
Protein sequence
MNAVTEGRKV LLEIADLKVH FEIKDGKQWF WQPPKTLKAV DGVTLRLYEG ETLGVVGESG 
CGKSTFARAI IGLVKATDGH VAWLGKELLG MKPDEWRAVR SDIQMIFQDP LASLNPRMTI
GEIIAEPLRT YHPKMSRQEV RERVKAMMLK VGLLPNLINR YPHEFSGGQC QRIGIARALI
LEPKLIICDE PVSALDVSIQ AQVVNLLQQL QREMGLSLIF IAHDLAVVKH ISDRVLVMYL
GHAVELGTYD EVYHNPLHPY TRALMSAVPI PDPDLEKNKT IQLLEGELPS PINPPSGCVF
RTRCPIAGPE CAKTRPVLEG SFRHAVSCLK VDPL