Gene EcSMS35_1105 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1105 
Symbol 
ID6145080 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1120428 
End bp1121462 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content52% 
IMG OID641615989 
Productinosine-uridine preferring nucleoside hydrolase 
Protein accessionYP_001743181 
Protein GI170681581 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1957] Inosine-uridine nucleoside N-ribohydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCTGCCA AGGGCATGAA GTTCAACCCG AGAAAAAGTG AAAATGAATC GATGCGTATC 
ATTATTGACT GCGATCCGGG GAACGGCATT CCCGGCGCTA ATATCGACGA CGGTCTGGCG
CTGGCGCTGG CCATTGCCGC ACCGCAAATC GCGCTGGAAA TGATCACCAC TGTTGCCGGA
AATACGCCGG TAGACGTGGG ATATGCGGTC GCTAGAGATC TCATTACACA ATTGGATATT
CCTGTTGCGG TCTACCGTGG TGCCTCGCGT GCGCTCTTGG AAGATCCGCA ACCCTGGCGT
GAAAAACTGG ATCATGGTGT CGATCAGTTT GGTTTACGGC AACTCTGGTC GAATGTTCCT
GCTCCAGCAT TGTGCCAGCA GGTAGAACCC CATGCGCCTG AAGCAATAGG TGAACTCATC
TGTCGTAATC CAGGCGAAAT AACGTTGGTT GCTACCGGCC CCCTCACTAA TGTAGCCATC
GCCCTGCAGC TTTACCCGCA GATTGTTCAC GCGGTCAAAA ATATCGTCGT TATGGGTGGC
GTGTTCAATG TTCCAGGCTA CCTGAAAGAT ACAAATTTTG GTCTGGACCC TGAGGCGGCT
CATGCGGTGC TCACCAGCGG TGCGCCAGTC ACGCTGGTCC CGATGGATGT GACAACGCAA
ACCCAAATGC TTCACGCCGA TTTGGATCGT CTAGCAAAAA CAGAAAACGG GCTTAGCCGT
TATTTGGCAC AAACCATTCG ACCATGGATT ACATACTCTA TGCAAACCCG CAATCTGCCT
GGGTGTTGGA TCCACGATGT GTTAACCATT GCCTGGTTAC TGGATCCCTC TCTTGCAACA
ACGGCTGAAG ATTATCTGGA TGTATCTCTG GAAGGCATTA CACGCGGAAT GACTTGTTGC
TATGGACGTG ACACATTACG CCTCAATATT GGGATCCCTG AACCAAAAGG TGCACAGGTC
ACAATTCTGC AGAGCATCGA TAACCCGCGG CTTATTTCGC TGATAGAGCA CTATATCCAG
AACTACGGCG CGTAG
 
Protein sequence
MAAKGMKFNP RKSENESMRI IIDCDPGNGI PGANIDDGLA LALAIAAPQI ALEMITTVAG 
NTPVDVGYAV ARDLITQLDI PVAVYRGASR ALLEDPQPWR EKLDHGVDQF GLRQLWSNVP
APALCQQVEP HAPEAIGELI CRNPGEITLV ATGPLTNVAI ALQLYPQIVH AVKNIVVMGG
VFNVPGYLKD TNFGLDPEAA HAVLTSGAPV TLVPMDVTTQ TQMLHADLDR LAKTENGLSR
YLAQTIRPWI TYSMQTRNLP GCWIHDVLTI AWLLDPSLAT TAEDYLDVSL EGITRGMTCC
YGRDTLRLNI GIPEPKGAQV TILQSIDNPR LISLIEHYIQ NYGA