Gene EcolC_2171 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2171 
Symbol 
ID6066166 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2382945 
End bp2383967 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content55% 
IMG OID641601578 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_001725137 
Protein GI170020183 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0601] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCTTCT GGAGTATTTT ACGCCAACGC TGCTGGGGGC TGGTGCTCGT GGTGGCGGGC 
GTCTGCGTGA TTACGTTTAT TATCTCGCAT CTGATCCCTG GCGATCCGGC GCGGTTACTG
GCGGGTGACC GCGCCAGCGA TGCTATCGTG GAAAATATTC GCCAGCAACT GGGACTGGAC
CAGCCACTGT ATGTACAGTT TTACCGCTAC GTCAGCGACC TGTTTCATGG TGACCTGGGA
ACATCCATTC GTACCGGGCG TCCGGTGCTG GAAGAGTTGC GTATATTTTT CCCGGCGACG
CTGGAACTGG CTTTTTGTGC CCTGCTGCTG GCACTCCTGA TTGGCATCCC GCTGGGCATA
CTCTCTGCAG TCTGGCGAAA TCGCTGGCTG GATCATCTGG TGCGAATAAT GGCCATTACC
GGAATCTCCA CACCTGCGTT CTGGCTTGGA CTGGGCGTCA TTGTGCTGTT TTATGGTCAT
CTGCAAATTC TTCCCGGCGG CGGAAGGCTT GATGACTGGC TGGATCCACC AACGCACGTT
ACCGGCTTTT ATCTGCTCGA TGCGCTGCTT GAAGGCAACG GTGAAGTCTT CTTCAATGCG
TTGCAACATC TCATCTTACC GGCATTAACG CTGGCGTTCG TTCACCTGGG AATTGTCGCT
CGCCAGATCC GCTCAGCGAT GCTGGAACAA TTGAGTGAAG ACTACATTCG TACCGCCCGG
GCCAGCGGCT TGCCCGGCTG GTATATCGTT TTATGTTATG CGCTACCCAA TGCGTTGATC
CCATCGATTA CCGTATTGGG TCTGGCGCTG GGCGATTTGT TGTATGGCGC AGTGCTCACC
GAAACCGTTT TTGCCTGGCC CGGAATGGGT GCATGGGTAG TAACATCAAT ACAGGCGCTC
GACTTCCCGG CAGTGATGGG CTTTGCCGTC GTGGTTTCAT TTGCTTATGT GCTGGTCAAC
CTGGTGGTGG ATTTGCTCTA TTTGTGGATT GATCCGCGTA TCGGACGTGG AGGTGGTGAA
TGA
 
Protein sequence
MTFWSILRQR CWGLVLVVAG VCVITFIISH LIPGDPARLL AGDRASDAIV ENIRQQLGLD 
QPLYVQFYRY VSDLFHGDLG TSIRTGRPVL EELRIFFPAT LELAFCALLL ALLIGIPLGI
LSAVWRNRWL DHLVRIMAIT GISTPAFWLG LGVIVLFYGH LQILPGGGRL DDWLDPPTHV
TGFYLLDALL EGNGEVFFNA LQHLILPALT LAFVHLGIVA RQIRSAMLEQ LSEDYIRTAR
ASGLPGWYIV LCYALPNALI PSITVLGLAL GDLLYGAVLT ETVFAWPGMG AWVVTSIQAL
DFPAVMGFAV VVSFAYVLVN LVVDLLYLWI DPRIGRGGGE