Gene EcSMS35_0966 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0966 
SymbolyegT 
ID6145336 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp976065 
End bp977342 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content50% 
IMG OID641615853 
Productnucleoside transporter 
Protein accessionYP_001743045 
Protein GI170680614 
COG category 
COG ID 
TIGRFAM ID[TIGR00889] nucleoside transporter 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value0.998859 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACAA CAGCAAAGCT GTCGTTCATG ATGTTTGTTG AATGGTTTAT CTGGGGCGCG 
TGGTTTGTGC CATTGTGGTT GTGGTTAAGT AAAAGCGGTT TTAGTGCCGG AGAAATTGGC
TGGTCGTATG CCTGTACCGC CATTGCGGCG ATCCTGTCGC CGATTCTGGT TGGCTCCATC
ACTGACCGCT TTTTCTCGGC ACAGAAAGTG CTGGCGGTAT TGATGTTCGC TGGTGCGGTG
CTGATGTATT TCGCTGCGCA ACAGACCACT TTTGCCGGCT TCTTCCCGTT ACTGCTGGCC
TACTCGCTAA CCTATATGCC GACCATTGCG CTGACTAACA GTATCGCTTT TGCCAACGTG
CCGGATGTGG AGCGTGATTT CCCGCGCATT CGTGTGATGG GCACTATCGG CTGGATTGCC
TCTGGTCTGG CATGTGGATT CTTGCCGCAA ATGCTGGGTT ATGCCGATAT CTCACCGACG
AACATCCCGC TGCTGATTAC TGCCGGAAGT TCTGCTCTGC TTGGTGTGTT TGCGTTTTTC
CTGCCCGACA CGCCACCAAA AAGCACCGGC AAAATGGATA TTAAAGTCAT GCTCGGCCTG
GATGCGCTGA TCCTGCTGCG CGATAAGAAC TTCCTCGTCT TTTTCTTCTG TTCATTCCTG
TTTGCGATGC CACTGGCGTT CTATTACATC TTTGCCAACG GTTATCTGAC CGAAGTTGGC
ATGAAAAACG CCACTGGCTG GATGACGCTC GGCCAGTTCT CTGAAATCTT CTTTATGCTG
GCATTGCCGT TTTTCACTAA ACGCTTTGGT ATCAAAAAGG TATTGTTGCT TGGTCTGGTC
ACCGCTGCGA TCCGCTATGG CTTCTTTATT TACGGTAGTG CGGATGAATA TTTCACCTAC
GCGTTACTGT TCCTCGGCAT TTTGCTGCAC GGCGTAAGTT ACGATTTTTA CTACGTTACC
GCTTACATCT ATGTCGATAA AAAAGCCCCC GTGCATATGC GTACCGCTGC GCAGGGGCTG
ATCACGCTCT GCTGCCAGGG CTTCGGCAGT TTGCTCGGCT ATCGTCTTGG CGGTGTGATG
ATGGAAAAGA TGTTCGCTTA TCAGGAACCG GTAAACGGAC TGACTTTCAA CTGGGCCGGG
ATGTGGACTT TCGGCGCGGT GATGATTGCC ATTATCGCCG TGCTGTTCAT GATTTTTTTC
CGCGAATCCG ACAACGAAAT TACGGCTATC AAGGTCGATG ATCGCGATAT TGCGTTGACA
CAAGGGGAAG TTAAATGA
 
Protein sequence
MKTTAKLSFM MFVEWFIWGA WFVPLWLWLS KSGFSAGEIG WSYACTAIAA ILSPILVGSI 
TDRFFSAQKV LAVLMFAGAV LMYFAAQQTT FAGFFPLLLA YSLTYMPTIA LTNSIAFANV
PDVERDFPRI RVMGTIGWIA SGLACGFLPQ MLGYADISPT NIPLLITAGS SALLGVFAFF
LPDTPPKSTG KMDIKVMLGL DALILLRDKN FLVFFFCSFL FAMPLAFYYI FANGYLTEVG
MKNATGWMTL GQFSEIFFML ALPFFTKRFG IKKVLLLGLV TAAIRYGFFI YGSADEYFTY
ALLFLGILLH GVSYDFYYVT AYIYVDKKAP VHMRTAAQGL ITLCCQGFGS LLGYRLGGVM
MEKMFAYQEP VNGLTFNWAG MWTFGAVMIA IIAVLFMIFF RESDNEITAI KVDDRDIALT
QGEVK