Gene EcolC_2381 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2381 
Symbol 
ID6067523 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2619859 
End bp2620863 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content52% 
IMG OID641601784 
Productoligopeptide/dipeptide ABC transporter, ATPase subunit 
Protein accessionYP_001725343 
Protein GI170020389 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4608] ABC-type oligopeptide transport system, ATPase component 
TIGRFAM ID[TIGR01727] oligopeptide/dipeptide ABC transporter, ATP-binding protein, C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.643182 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000143297 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAATGCTG TAACTGAAGG AAGAAAAGTC CTCCTTGAAA TCGCCGATCT TAAAGTGCAC 
TTTGAAATCA AAGATGGCAA ACAGTGGTTC TGGCAACCGC CGAAAACGCT CAAAGCCGTC
GATGGTGTCA CTCTTCGCCT GTATGAAGGG GAAACATTAG GTGTGGTAGG GGAATCGGGA
TGCGGTAAGT CCACCTTTGC TCGCGCCATC ATCGGTTTGG TCAAGGCGAC CGACGGTCAT
GTTGCCTGGT TAGGTAAAGA GTTGCTGGGC ATGAAGCCCG ATGAATGGCG TGCCGTTCGC
AGTGATATTC AGATGATTTT CCAGGATCCG TTGGCATCGC TAAACCCGCG TATGACCATC
GGCGAGATCA TCGCTGAACC ACTGCGTACT TATCATCCGA AAATGTCACG CCAGGAAGTT
CGCGAGCGCG TGAAGGCGAT GATGCTGAAA GTCGGGTTAT TGCCTAACCT GATTAACCGC
TATCCGCATG AGTGCTCCGG TGGGCAGTGC CAGCGTATCG GGATTGCTCG TGCTCTTATT
CTTGAACCGA AGCTGATTAT CTGCGATGAG CCGGTGTCGG CGCTGGACGT GTCAATTCAG
GCGCAGGTGG TCAACCTGCT CCAGCAGCTG CAACGTGAGA TGGGATTGTC ATTAATTTTT
ATCGCTCATG ACCTGGCCGT GGTAAAACAC ATTTCCGATC GTGTGTTGGT GATGTATCTC
GGCCATGCGG TAGAACTGGG GACCTATGAT GAGGTCTACC ACAATCCACT ACATCCTTAC
ACCAAGGCAT TGATGTCGGC AGTCCCCATA CCTGATCCGG ATCTGGAGAA GAACAAAACC
ATCCAGTTAC TGGAAGGGGA ATTACCGTCG CCGATCAACC CGCCTTCCGG TTGTGTTTTC
CGTACCCGTT GCCCGATTGC CGGTCCGGAG TGCGCCAAAA CACGTCCTGT TCTGGAGGGG
AGTTTCAGAC ACGCCGTTTC TTGCCTGAAA GTCGATCCGC TTTAA
 
Protein sequence
MNAVTEGRKV LLEIADLKVH FEIKDGKQWF WQPPKTLKAV DGVTLRLYEG ETLGVVGESG 
CGKSTFARAI IGLVKATDGH VAWLGKELLG MKPDEWRAVR SDIQMIFQDP LASLNPRMTI
GEIIAEPLRT YHPKMSRQEV RERVKAMMLK VGLLPNLINR YPHECSGGQC QRIGIARALI
LEPKLIICDE PVSALDVSIQ AQVVNLLQQL QREMGLSLIF IAHDLAVVKH ISDRVLVMYL
GHAVELGTYD EVYHNPLHPY TKALMSAVPI PDPDLEKNKT IQLLEGELPS PINPPSGCVF
RTRCPIAGPE CAKTRPVLEG SFRHAVSCLK VDPL