Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_4170 |
Symbol | nepI |
ID | 5589851 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | - |
Start bp | 4157172 |
End bp | 4158527 |
Gene Length | 1356 bp |
Protein Length | 451 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640927787 |
Product | ribonucleoside transporter |
Protein accession | YP_001465146 |
Protein GI | 157157594 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGATAAGA CGTTCACGCC GCATCCGGTA ATCTTTGCCC TGTTCCGTTT CGATCTTAAC TCCAGCTCAA GCAACACCGT AAAAGTTGCA AAAACGTTAA ATGCGTCACA CATTTCAATG TTAAAGTGGT CGGCTTTTCC CCTGAAACAT GCCACGGGTA ACACCATGAG TGAATTTATT GCCGAAAACC GCGGCGCGGA TGCCATCACC CGACCGAACT GGTCAGCCGT TTTCTCGGTG GCGTTTTGTG TCGCCTGTCT GATTATCGTT GAGTTTTTGC CCGTCAGTTT GTTGACGCCT ATGGCCCAGG ATTTAGGCAT TTCGGAAGGG GTTGCCGGGC AATCGGTGAC CGTGACCGCC TTTGTGGCAA TGTTTGCCAG TTTGTTTATT ACCCAGACAA TTCAGGCTAC TGACCGCCGC TACGTTGTTA TTTTGTTTGC CGTTTTGCTG ACGCTCTCCT GCTTGCTGGT TTCCTTTGCT AACTCATTCA GTTTGCTTTT AATCGGTCGT GCCTGTCTGG GGCTGGCGCT GGGCGGGTTC TGGGCGATGT CGGCGTCGCT GACCATGCGT CTGGTGCCGC CGCGTACGGT GCCGAAGGCG CTGTCGGTGA TCTTCGGCGC GGTTTCTATT GCGCTGGTGA TTGCCGCGCC GTTGGGCAGT TTTTTAGGCG AGCTTATCGG TTGGCGCAAT GTCTTTAATG CGGCGGCGGT GATGGGCGTG CTGTGTATTT TCTGGATTAT CAAATCATTG CCTTCACTGC CAGGCGAACC CTCGCATCAG AAACAAAATA CTTTCCGCTT ATTACAACGT CCGGGTGTGA TGGCAGGGAT GATCGCCATC TTCATGTCTT TCGCCGGGCA GTTTGCTTTC TTCACATATA TTCGCCCGGT GTATATGAAC CTGGCGGGAT TCGGCGTGGA TGGCTTAACG CTGGTGCTGT TGAGTTTTGG CATTGCCAGC TTTATCGGTA CGTCGCTTTC GTCATTCATT CTTAAACGTT CGGTAAAACT GGCCTTAGCA GGCGCGCCGT TAATACTGGC TGTGAGTGCG TTGGTGCTGA CGTTGTGGGG AAGCGATAAA ATCGTTGCTA CCGGCGTGGC GATTATCTGG GGGCTAACTT TTGCATTGGT TCCCGTCGGC TGGTCAACGT GGATCACCCG CTCGCTGGCC GATCAGGCAG AAAAAGCCGG GTCTATTCAG GTGGCGGTTA TTCAGCTTGC TAATACCTGT GGCGCGGCAA TCGGCGGTTA TGCGCTGGAT AATATTGGTC TGACTTCGCC GTTGATGTTG TCCGGCACAT TGATGTTGCT GACTGCATTG TTGGTTACTG CAAAGGTGAA AATGAAGAAA TCCTGA
|
Protein sequence | MDKTFTPHPV IFALFRFDLN SSSSNTVKVA KTLNASHISM LKWSAFPLKH ATGNTMSEFI AENRGADAIT RPNWSAVFSV AFCVACLIIV EFLPVSLLTP MAQDLGISEG VAGQSVTVTA FVAMFASLFI TQTIQATDRR YVVILFAVLL TLSCLLVSFA NSFSLLLIGR ACLGLALGGF WAMSASLTMR LVPPRTVPKA LSVIFGAVSI ALVIAAPLGS FLGELIGWRN VFNAAAVMGV LCIFWIIKSL PSLPGEPSHQ KQNTFRLLQR PGVMAGMIAI FMSFAGQFAF FTYIRPVYMN LAGFGVDGLT LVLLSFGIAS FIGTSLSSFI LKRSVKLALA GAPLILAVSA LVLTLWGSDK IVATGVAIIW GLTFALVPVG WSTWITRSLA DQAEKAGSIQ VAVIQLANTC GAAIGGYALD NIGLTSPLML SGTLMLLTAL LVTAKVKMKK S
|
| |