Gene ECH74115_3077 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3077 
SymbolyegT 
ID6967103 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2849148 
End bp2850425 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content51% 
IMG OID643386909 
Productnucleoside transporter 
Protein accessionYP_002271377 
Protein GI209398133 
COG category 
COG ID 
TIGRFAM ID[TIGR00889] nucleoside transporter 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.0170567 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACAA CAACAAAGCT GTCGTTCATG ATGTTTGTTG AATGGTTTAT CTGGGGCGCG 
TGGTTTGTGC CATTGTGGCT GTGGTTAAGT AAAAGCGGTT TTAGTGCCGG AGAAATTGGC
TGGTCGTATG CCTGCACAGC CATTGCGGCG ATCCTGTCGC CAATTCTGGT TGGCTCCATC
ACTGACCGCT TTTTCTCGGC GCAGAAAGTG CTGGCGGTAT TGATGTTCAC TGGTGCGGCG
CTGATGTATT TCGCTGCGCA ACAGACCACT TTTGCCGGGT TCTTCCCGTT ACTGCTGGCC
TACTCGCTAA CCTATATGCC GACCATTGCG CTGACTAACA GCATCGCTTT TGCCAACGTG
CCGGATGTGG AGCGTGATTT CCCGCGCATT CGTGTGATGG GCACTATCGG CTGGATTGCC
TCTGGTCTGG CATGTGGTTT CTTGCCGCAA ATGCTGGGGT ATGCCGATAT CTCACCGACT
AACATCCCGC TGCTGATTAC CGCCGGAAGT TCTGCTCTGC TCGGTGTGTT TGCGTTTTTC
CTGCCCGACA CGCCGCCAAA AAGCACCGGC AAAATGGACA TTAAAGTCAT GCTCGGCCTG
GATGCGCTGA TCCTGCTGCG CGATAAAAAC TTCCTCGTCT TTTTCTTCTG TTCATTCCTG
TTTGCGATGC CACTGGCGTT CTATTACATC TTTGCCAACG GTTATCTGAC CGAAGTTGGC
ATGAAAAACG CCACCGGCTG GATGACGCTC GGCCAGTTCT CTGAAATATT CTTTATGCTG
GCATTGCCGT TTTTCACCAA ACGCTTTGGT ATCAAAAAGG TATTATTGCT TGGTCTGGTC
ACCGCTGCGA TCCGCTATGG CTTCTTTATT TACGGTAGTG CGGATGAATA TTTCACCTAC
GCGTTACTGT TCCTCGGCAT TTTGCTGCAC GGCGTAAGTT ACGATTTTTA CTACGTTACC
GCTTACATCT ATGTCGATAA AAAAGCCCCC GTGCATATGC GTACCGCTGC GCAGGGACTG
ATCACGCTCT GCTGCCAGGG CTTCGGCAGT TTGCTCGGCT ATCGTCTTGG CGGTGTGATG
ATGGAAAAGA TGTTCGCTTA TCAGGAACCG GTAAACGGAC TGACTTTCAA CTGGTCCGGG
ATGTGGACTT TTGGCGCGGT GATGATTGCC ATTATCGCCG TGCTGTTCAT GATCTTTTTC
CGCGAATCCG ACAACGAAAT TACGGCTATC AAGGTCGATG ATCGCGATAT TGCGTTGACA
CAAGGGGAAG TTAAATGA
 
Protein sequence
MKTTTKLSFM MFVEWFIWGA WFVPLWLWLS KSGFSAGEIG WSYACTAIAA ILSPILVGSI 
TDRFFSAQKV LAVLMFTGAA LMYFAAQQTT FAGFFPLLLA YSLTYMPTIA LTNSIAFANV
PDVERDFPRI RVMGTIGWIA SGLACGFLPQ MLGYADISPT NIPLLITAGS SALLGVFAFF
LPDTPPKSTG KMDIKVMLGL DALILLRDKN FLVFFFCSFL FAMPLAFYYI FANGYLTEVG
MKNATGWMTL GQFSEIFFML ALPFFTKRFG IKKVLLLGLV TAAIRYGFFI YGSADEYFTY
ALLFLGILLH GVSYDFYYVT AYIYVDKKAP VHMRTAAQGL ITLCCQGFGS LLGYRLGGVM
MEKMFAYQEP VNGLTFNWSG MWTFGAVMIA IIAVLFMIFF RESDNEITAI KVDDRDIALT
QGEVK