Gene EcolC_0035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0035 
Symbol 
ID6068479 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp37576 
End bp38910 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content48% 
IMG OID641599439 
Productxanthine/uracil/vitamin C permease 
Protein accessionYP_001723049 
Protein GI170018095 
COG category[R] General function prediction only 
COG ID[COG2252] Permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAATG ACAATACCGA TTACGTGAGT AATGAATCAG GGACGCTTTC GCGATTATTT 
AAACTACCTC AGCATGGGAC CACCGTCCGC ACAGAATTGA TTGCGGGGAT GACCACTTTT
TTAACCATGG TGTACATCGT TTTTGTGAAC CCGCAAATCC TCGGCGCGGC ACAAATGGAC
CCGAAAGTGG TGTTTGTTAC CACCTGTTTG ATTGCCGGTA TCGGCAGTAT TGCGATGGGG
ATATTTGCTA ACTTACCCGT GGCGCTGGCT CCGGCAATGG GGCTGAACGC CTTCTTTGCC
TTCGTGGTCG TGGGGGCGAT GGGCATCTCC TGGCAGACCG GGATGGGCGC AATATTCTGG
GGCGCAGTTG GACTATTTTT GCTCACGCTG TTTCGTATCC GGTACTGGAT GATCTCCAAC
ATTCCCTTAA GTTTACGTAT TGGTATCACC AGCGGAATTG GATTATTTAT CGCCTTAATG
GGATTAAAAA ATACTGGCGT TATTGTCGCC AATAAAGACA CGCTGGTGAT GATTGGCGAT
TTAAGTTCTC ACGGCGTGTT GTTAGGTATT TTAGGGTTTT TTATTATAAC CGTGTTGTCA
TCACGTCATT TTCATGCCGC GGTGCTGGTT TCTATTGTGG TGACGTCTTG CTGTGGATTA
TTTTTCGGTG ATGTTCATTT TAGCGGCGTC TATTCCATTC CGCCTGATAT TAGCGGCGTC
ATTGGTGAAG TAGATTTGAG CGGCGCGTTA ACACTTGAAC TCGCCGGTAT CATTTTCTCC
TTTATGCTGA TCAACCTATT TGATTCATCA GGAACATTAA TTGGTGTAAC TGATAAAGCG
GGCTTAATAG ATGGTAACGG TAAATTCCCC AATATGAATA AGGCGCTGTA TGTTGATAGC
GTCAGTTCGG TGGCGGGTGC GTTTATCGGC ACCTCGTCTG TTACTGCCTA TATTGAAAGT
ACTTCTGGTG TGGCAGTCGG TGGCCGCACG GGGCTGACTG CGGTTGTGGT TGGCGTTATG
TTCCTGTTGG TTATGTTCTT CTCACCGCTG GTGGCGATAG TTCCTCCTTA CGCAACCGCC
GGAGCGTTAA TCTTTGTTGG CGTGCTGATG ACTTCGAGCC TGGCGCGCGT TAACTGGGAT
GATTTTACCG AATCGGTGCC TGCGTTTATT ACCACGGTGA TGATGCCCTT TACTTTCTCG
ATCACCGAAG GGATTGCACT CGGCTTTATG TCGTACTGCA TCATGAAAGT ATGCACCGGG
CGCTGGCGCG ATCTGAACCT GTGTGTGGTG GTGGTCGCAG CTCTGTTTGC ACTGAAGATT
ATTCTGGTGG ATTAG
 
Protein sequence
MNNDNTDYVS NESGTLSRLF KLPQHGTTVR TELIAGMTTF LTMVYIVFVN PQILGAAQMD 
PKVVFVTTCL IAGIGSIAMG IFANLPVALA PAMGLNAFFA FVVVGAMGIS WQTGMGAIFW
GAVGLFLLTL FRIRYWMISN IPLSLRIGIT SGIGLFIALM GLKNTGVIVA NKDTLVMIGD
LSSHGVLLGI LGFFIITVLS SRHFHAAVLV SIVVTSCCGL FFGDVHFSGV YSIPPDISGV
IGEVDLSGAL TLELAGIIFS FMLINLFDSS GTLIGVTDKA GLIDGNGKFP NMNKALYVDS
VSSVAGAFIG TSSVTAYIES TSGVAVGGRT GLTAVVVGVM FLLVMFFSPL VAIVPPYATA
GALIFVGVLM TSSLARVNWD DFTESVPAFI TTVMMPFTFS ITEGIALGFM SYCIMKVCTG
RWRDLNLCVV VVAALFALKI ILVD