Gene EcolC_2029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2029 
SymboltqsA 
ID6067906 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2241545 
End bp2242579 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content50% 
IMG OID641601441 
Productputative transport protein 
Protein accessionYP_001725000 
Protein GI170020046 
COG category[R] General function prediction only 
COG ID[COG0628] Predicted permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.854402 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.642454 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAAGC CGATCATCAC GCTCAATGGC CTAAAAATCG TCATTATGTT GGGAATGCTG 
GTCATTATTC TCTGCGGTAT CCGTTTTGCC GCCGAGATCA TCGTGCCGTT TATTCTCGCA
TTATTTATTG CTGTTATTCT TAACCCGCTG GTGCAACACA TGGTCCGCTG GCGTGTGCCG
CGTGTACTGG CGGTGTCGAT TTTGATGACC ATCATCGTGA TGGCGATGGT GTTGCTATTA
GCTTATCTGG GTTCCGCGCT CAACGAGTTG ACGCGGACGT TACCGCAATA TCGCAACTCT
ATTATGACGC CGCTGCAAGC GCTTGAACCG TTGTTGCAAC GCGTAGGGAT TGACGTCTCA
GTTGACCAGC TGGCGCATTA TATTGATCCG AACGCGGCGA TGACGTTGCT CACCAACTTA
TTGACGCAGT TATCTAATGC CATGTCATCA ATATTTTTAT TGCTGCTGAC GGTGCTGTTT
ATGCTGCTCG AAGTGCCACA ATTGCCCGGA AAATTTCAGC AAATGATGGC GCGTCCGGTT
GAAGGGATGG CGGCGATTCA ACGTGCGATT GACAGTGTTT CTCATTATCT GGTGCTGAAA
ACAGCCATCA GCATCATCAC CGGCCTGGTC GCCTGGGCGA TGCTCGCCGC ACTCGATGTT
CGCTTCGCTT TTGTCTGGGG ATTGCTGGCC TTTGCGCTTA ATTACATCCC GAATATTGGT
TCAGTCCTCG CGGCAATCCC CCCTATCGCT CAGGTACTGG TGTTTAATGG CTTCTACGAA
GCGTTGCTGG TGCTGGCGGG ATATCTGCTG ATTAATCTGG TCTTCGGCAA TATTCTGGAG
CCGCGTATCA TGGGGCGTGG GCTGGGGCTT TCCACATTGG TGGTATTTTT GTCGTTGATT
TTTTGGGGAT GGTTGTTAGG ACCGGTGGGT ATGCTGCTTT CCGTGCCGTT GACAATTATT
GTCAAAATTG CGCTTGAACA AACAGCGGGA GGTCAAAGCA TCGCCGTTCT GTTAAGCGAT
CTCAATAAAG AGTGA
 
Protein sequence
MAKPIITLNG LKIVIMLGML VIILCGIRFA AEIIVPFILA LFIAVILNPL VQHMVRWRVP 
RVLAVSILMT IIVMAMVLLL AYLGSALNEL TRTLPQYRNS IMTPLQALEP LLQRVGIDVS
VDQLAHYIDP NAAMTLLTNL LTQLSNAMSS IFLLLLTVLF MLLEVPQLPG KFQQMMARPV
EGMAAIQRAI DSVSHYLVLK TAISIITGLV AWAMLAALDV RFAFVWGLLA FALNYIPNIG
SVLAAIPPIA QVLVFNGFYE ALLVLAGYLL INLVFGNILE PRIMGRGLGL STLVVFLSLI
FWGWLLGPVG MLLSVPLTII VKIALEQTAG GQSIAVLLSD LNKE