Gene EcolC_3871 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3871 
Symbol 
ID6064620 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4226662 
End bp4227918 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content52% 
IMG OID641603286 
Productinner membrane protein YjeH 
Protein accessionYP_001726802 
Protein GI170021848 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0531] Amino acid transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.915973 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGGAC TCAAACAAGA ACTGGGGCTG GCCCAGGGCA TTGGACTGCT ATCGACGTCA 
TTATTAGGCA CTGGCGTGTT TGCCGTTCCT GCGTTAGCTG CGCTGGTAGC GGGCAATAAC
AGCCTGTGGG CGTGGCCCGT TTTGATTATC TTAGTGTTCC CGATTGCGAT TGTGTTTGCG
ATTCTGGGTC GCCACTATCC CAGTGCAGGC GGCGTCGCGC ACTTCGTCGG TATGGCGTTT
GGTTCGCGGC TTGAGCGAGT CACCGGCTGG CTGTTTTTAT CGGTCATTCC CGTGGGTTTG
CCTGCCGCGC TACAAATTGC CGCCGGGTTC GGCCAGGCGA TGTTTGGCTG GCATAGCTGG
CAACTGTTGT TGGCAGAACT CGGTACGCTG GCGTTGGTGT GGTATATCGG TACTCGCGGT
GCCAGTTCCA GTGCAAATCT ACAAACCGTT ATTGCCGGAC TTATCGTCGC ACTGATTGTC
GCTATCTGGT GGGCGGGCGA TATCAAACCT GCGAGTATCC CCTTCCCCGC GCCAGGAAAT
ATCGAACTTA CCGGGTTATT TGCTGCGTTA TCAGTGATGT TCTGGTGTTT TGTCGGTCTG
GAGGCATTTG CCCATCTCGC CTCGGAATTT AAAAATCCAG AGCGTGATTT TCCTCGTGCT
TTGATGATTG GTCTGCTGCT GGCAGGATTA GTCTACTGGG GCTGTACGGT AGTCGTCTTA
CACTTCGACG CCTATGGTGA AAAAATGGCG GCGGCAGCAT CGCTTCCAAA AATTGTAGTG
CAGTTGTTCG GTGTAGGAGC GTTATGGATT GCCTGCGTGA TTGGCTATCT GGCCTGCTTT
GCCAGTCTCA ACATTTATAT ACAGAGCTTC GCCCGCCTGG TCTGGTCGCA GGCGCAACAT
AATCCTGACC ACTACCTGGC ACGCCTCTCT TCTCGCCATA TCCCGAATAA TGCCCTCAAT
GCGGTGCTCG GCTGCTGTGT GGTGAGCACT TTGGTGATTC ATGCTTTAGA GATCAATCTG
GACGCTCTTA TTATTTATGC CAATGGCATC TTTATTATGA TTTATCTGTT ATGCATGCTG
GCAGGCTGTA AATTATTGCA AGGACGTTAT CGACTACTGG CGGTGGTTGG CGGGCTGTTA
TGCGTTCTGT TACTGGCAAT GGTCGGCTGG AAAAGTCTCT ATGCGCTGAT CATGCTGGCG
GGGTTATGGC TGTTGCTGCC AAAACGAAAA ACGCCGGAAA ATGGCATAAC CACATAA
 
Protein sequence
MSGLKQELGL AQGIGLLSTS LLGTGVFAVP ALAALVAGNN SLWAWPVLII LVFPIAIVFA 
ILGRHYPSAG GVAHFVGMAF GSRLERVTGW LFLSVIPVGL PAALQIAAGF GQAMFGWHSW
QLLLAELGTL ALVWYIGTRG ASSSANLQTV IAGLIVALIV AIWWAGDIKP ASIPFPAPGN
IELTGLFAAL SVMFWCFVGL EAFAHLASEF KNPERDFPRA LMIGLLLAGL VYWGCTVVVL
HFDAYGEKMA AAASLPKIVV QLFGVGALWI ACVIGYLACF ASLNIYIQSF ARLVWSQAQH
NPDHYLARLS SRHIPNNALN AVLGCCVVST LVIHALEINL DALIIYANGI FIMIYLLCML
AGCKLLQGRY RLLAVVGGLL CVLLLAMVGW KSLYALIMLA GLWLLLPKRK TPENGITT