Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1105 |
Symbol | |
ID | 6145080 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1120428 |
End bp | 1121462 |
Gene Length | 1035 bp |
Protein Length | 344 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641615989 |
Product | inosine-uridine preferring nucleoside hydrolase |
Protein accession | YP_001743181 |
Protein GI | 170681581 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG1957] Inosine-uridine nucleoside N-ribohydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 54 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCTGCCA AGGGCATGAA GTTCAACCCG AGAAAAAGTG AAAATGAATC GATGCGTATC ATTATTGACT GCGATCCGGG GAACGGCATT CCCGGCGCTA ATATCGACGA CGGTCTGGCG CTGGCGCTGG CCATTGCCGC ACCGCAAATC GCGCTGGAAA TGATCACCAC TGTTGCCGGA AATACGCCGG TAGACGTGGG ATATGCGGTC GCTAGAGATC TCATTACACA ATTGGATATT CCTGTTGCGG TCTACCGTGG TGCCTCGCGT GCGCTCTTGG AAGATCCGCA ACCCTGGCGT GAAAAACTGG ATCATGGTGT CGATCAGTTT GGTTTACGGC AACTCTGGTC GAATGTTCCT GCTCCAGCAT TGTGCCAGCA GGTAGAACCC CATGCGCCTG AAGCAATAGG TGAACTCATC TGTCGTAATC CAGGCGAAAT AACGTTGGTT GCTACCGGCC CCCTCACTAA TGTAGCCATC GCCCTGCAGC TTTACCCGCA GATTGTTCAC GCGGTCAAAA ATATCGTCGT TATGGGTGGC GTGTTCAATG TTCCAGGCTA CCTGAAAGAT ACAAATTTTG GTCTGGACCC TGAGGCGGCT CATGCGGTGC TCACCAGCGG TGCGCCAGTC ACGCTGGTCC CGATGGATGT GACAACGCAA ACCCAAATGC TTCACGCCGA TTTGGATCGT CTAGCAAAAA CAGAAAACGG GCTTAGCCGT TATTTGGCAC AAACCATTCG ACCATGGATT ACATACTCTA TGCAAACCCG CAATCTGCCT GGGTGTTGGA TCCACGATGT GTTAACCATT GCCTGGTTAC TGGATCCCTC TCTTGCAACA ACGGCTGAAG ATTATCTGGA TGTATCTCTG GAAGGCATTA CACGCGGAAT GACTTGTTGC TATGGACGTG ACACATTACG CCTCAATATT GGGATCCCTG AACCAAAAGG TGCACAGGTC ACAATTCTGC AGAGCATCGA TAACCCGCGG CTTATTTCGC TGATAGAGCA CTATATCCAG AACTACGGCG CGTAG
|
Protein sequence | MAAKGMKFNP RKSENESMRI IIDCDPGNGI PGANIDDGLA LALAIAAPQI ALEMITTVAG NTPVDVGYAV ARDLITQLDI PVAVYRGASR ALLEDPQPWR EKLDHGVDQF GLRQLWSNVP APALCQQVEP HAPEAIGELI CRNPGEITLV ATGPLTNVAI ALQLYPQIVH AVKNIVVMGG VFNVPGYLKD TNFGLDPEAA HAVLTSGAPV TLVPMDVTTQ TQMLHADLDR LAKTENGLSR YLAQTIRPWI TYSMQTRNLP GCWIHDVLTI AWLLDPSLAT TAEDYLDVSL EGITRGMTCC YGRDTLRLNI GIPEPKGAQV TILQSIDNPR LISLIEHYIQ NYGA
|
| |