Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4025 |
Symbol | nepI |
ID | 6145324 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4113194 |
End bp | 4114378 |
Gene Length | 1185 bp |
Protein Length | 394 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641618850 |
Product | ribonucleoside transporter |
Protein accession | YP_001745988 |
Protein GI | 170681625 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 0.84574 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGAAT TTATTGCCGA AAACCGCGGC GCGGATGTCA TCACCCGACC GAACTGGTCA GCCGTATTCT CGGTGGCGTT TTGCGTCGCC TGTCTGATTA TCGTTGAGTT TTTGCCCGTC AGTTTGTTGA CGCCAATGGC CCAGGATTTA GGCATTTCGG AAGGGGTTGC CGGGCAATCG GTGACCGTGA CCGCCTTTGT GGCAATGTTT GCCAGTTTGT TTATTACCCA GACCATTCAG GCCACTGACC GCCGCTATGT TGTTATTTTG TTTGCCGTTT TGCTGACGCT CTCCTGCTTG CTGGTTTCCT TTGCTAACTC ATTCAGTTTG CTTTTAATCG GTCGTGCCTG TCTGGGGCTG GCGCTGGGTG GGTTCTGGGC GATGTCGGCG TCGCTGACCA TGCGTCTGGT ACCGCCGCGT ACGGTGCCGA AGGCGCTGTC GGTGATCTTC GGCGCGGTTT CTATTGCGCT GGTGATTGCC GCACCGTTGG GCAGTTTTTT AGGCGAGCTT ATCGGTTGGC GCAATGTCTT TAATGCGGCG GCGGTGATGG GCGTGCTGTG TATTTTCTGG ATTATCAAAT CATTGCCTTC GCTGCCAGGC GAACCCTCGC ATCAGAAACA AAATACTTTC CGCTTATTAC AACGTCCGGG TGTGATGGCA GGGATGATCG CCATATTCAT GTCTTTCGCC GGGCAGTTTG CTTTCTTCAC GTATATTCGC CCGGTGTATA TGAACCTGGC GGGATTCGGC GTGGATGGCT TAACGCTGGT GCTGTTGAGT TTTGGTATCG CCAGCTTTAT TGGTACGTCG CTTTCGTCGT TCATTCTTAA ACGTTCGGTA AAACTGGCCT TAGCAGGCGC GCCGTTAATA CTGGCTGTGA GTGCGTTGGT GCTGACGTTG TGGGGAAGCG ATAAAATCGT TGCTACCGGC GTGGCGATTA TCTGGGGGCT AACTTTTGCA TTGGTTCCTG TCGGCTGGTC AACGTGGATC ACCCGCTCAC TGGCCGATCA GGCAGAAAAA GCCGGGTCTA TTCAGGTGGC GGTTATTCAG CTTGCTAATA CCTGTGGCGC GGCAATCGGC GGTTATGCGC TGGATAATAT TGGTCTGACT TCGCCGTTGA TGTTGTCCGG CACATTGATG TTGCTGACGG CGTTATTGGT CACCGCAAAG GTGAAAATGA AGTAA
|
Protein sequence | MSEFIAENRG ADVITRPNWS AVFSVAFCVA CLIIVEFLPV SLLTPMAQDL GISEGVAGQS VTVTAFVAMF ASLFITQTIQ ATDRRYVVIL FAVLLTLSCL LVSFANSFSL LLIGRACLGL ALGGFWAMSA SLTMRLVPPR TVPKALSVIF GAVSIALVIA APLGSFLGEL IGWRNVFNAA AVMGVLCIFW IIKSLPSLPG EPSHQKQNTF RLLQRPGVMA GMIAIFMSFA GQFAFFTYIR PVYMNLAGFG VDGLTLVLLS FGIASFIGTS LSSFILKRSV KLALAGAPLI LAVSALVLTL WGSDKIVATG VAIIWGLTFA LVPVGWSTWI TRSLADQAEK AGSIQVAVIQ LANTCGAAIG GYALDNIGLT SPLMLSGTLM LLTALLVTAK VKMK
|
| |