Gene EcolC_0336 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0336 
Symbol 
ID6065652 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp387593 
End bp388897 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content53% 
IMG OID641599735 
Productputative transport system permease protein 
Protein accessionYP_001723341 
Protein GI170018387 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.354988 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCTGT ATATTCAGAT TATCGTGGTG GCGTGCCTGA CGGGTATGAC ATCGCTTCTG 
GCGCATCGCT CGGCGGCTGT TTTTCATGAC GGCATACGCC CGATCCTGCC GCAACTGATT
GAAGGCTATA TGAACCGTCG CGAGGCGGGG AGTATCGCTT TTGGTCTGAG CATTGGTTTT
GTGGCCTCGG TGGGGATCTC TTTTACCCTG AAAACCGGGC TGCTAAACGC ATGGTTACTC
TTTCTTCCTA CCGATATCCT CGGCGTACTG GCGATAAACA GCCTGATGGC GTTTGGTCTT
GGCGCTATCT GGGGCGTGTT GATCCTTACT TGCCTGTTGC CAGTAAACCA GCTGCTGACC
GCGCTGCCGG TGGATGTATT AGGTAGCCTC GGGGAATTAA GCTCGCCGGT GGTTTCTGCT
TTTGCACTCT TCCCGTTGGT GGCGATTTTC TACCAGTTTG GCTGGAAGCA AAGTCTGGTC
GCCGCCGTGG TTGTTCTGAT GGCCCGTGTG GTAGTCGTGC GCTATTTCCC ACATCTTAAC
CCTGAATCCA TCGAAATCTT TATTGGCATG GTGATGCTGC TGGGAATCGC GATAACTCAC
GACCTGCGTC ATCGTGATGA AAATGACATC GATGCCAGCG GGCTTTCGGT GTTTGAAGAG
CGCACGTCGC GGATTATCAA AAACTTACCC TATATCGCCA TCGTGGGAGC ATTGATTGCC
GCCGTTGCCA GTATGAAGAT TTTCGCCGGC AGTGAAGTGT CGATCTTCAC TCTGGAGAAA
GCGTACTCCG CAGGCGTAAC GCCGGAACAA TCGCAAACGC TGATCAACCA GGCTGCTCTG
GCGGAGTTTA TGCGCGGACT GGGTTTTGTG CCGTTGATTG CCACCACCGC GTTAGCCACC
GGCGTATATG CAGTTGCGGG CTTTACCTTT GTTTATGCGG TGGGCTATCT CTCGCCGAAT
CCGATGGTTG CAGCGGTATT AGGCGCAGTG GTTATTTCGG CGGAAGTCTT GCTGCTTCGT
TCGATCGGCA AATGGCTGGG ACGCTACCCG TCGGTGCGTA ATGCGTCGGA TAACATCCGT
AACGCCATGA ATATGCTGAT GGAAGTGGCG CTGCTGGTCG GTTCGATTTT CGCAGCAATT
AAGATGGCGG GTTATACCGG ATTCTCTATC GCGGTTGCCA TTTACTTCCT CAACGAATCC
CTGGGCCGTC CGGTACAGAA AATGGCGGCA CCGGTCGTGG CAGTAATGAT CACCGGTATT
CTGCTGAATG TTCTTTACTG GCTTGGCCTG TTCGTTCCGG CTTAA
 
Protein sequence
MDLYIQIIVV ACLTGMTSLL AHRSAAVFHD GIRPILPQLI EGYMNRREAG SIAFGLSIGF 
VASVGISFTL KTGLLNAWLL FLPTDILGVL AINSLMAFGL GAIWGVLILT CLLPVNQLLT
ALPVDVLGSL GELSSPVVSA FALFPLVAIF YQFGWKQSLV AAVVVLMARV VVVRYFPHLN
PESIEIFIGM VMLLGIAITH DLRHRDENDI DASGLSVFEE RTSRIIKNLP YIAIVGALIA
AVASMKIFAG SEVSIFTLEK AYSAGVTPEQ SQTLINQAAL AEFMRGLGFV PLIATTALAT
GVYAVAGFTF VYAVGYLSPN PMVAAVLGAV VISAEVLLLR SIGKWLGRYP SVRNASDNIR
NAMNMLMEVA LLVGSIFAAI KMAGYTGFSI AVAIYFLNES LGRPVQKMAA PVVAVMITGI
LLNVLYWLGL FVPA