Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4031 |
Symbol | |
ID | 6968495 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 3725106 |
End bp | 3726383 |
Gene Length | 1278 bp |
Protein Length | 425 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 643387794 |
Product | major facilitator family transporter |
Protein accession | YP_002272237 |
Protein GI | 209400723 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG2223] Nitrate/nitrite transporter |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.80085 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 0.402393 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAACACA ACTCATATCG CCGTTGGATA ACCCTCGCGA TAATTAGTTT TAGCGGCGGC GTTAGTTTCG ACCTGGCTTA TTTACGTTAT ATTTATCAAA TTCCCATGGC GAAATTTATG GGATTCAGCA ATACCGAGAT AGGTTTAATA ATGAGTACCT TTGGTATTGC GGCTATTATT CTTTATGCCC CCAGCGGCGT TATTGCCGAT AAATTTTCAC ACCGCAAAAT GATTACTTCC GCGATGATCA TTACCGGATT ACTGGGTCTG TTAATGGCAA CGTATCCACC GCTGTGGGTA ATGCTCTGTA TTCAGGTCGC CTTTGCGATA ACAACGATTT TAATGCTGTG GTCGGTGTCG ATTAAAGCCG CATCGTTGCT TGGCGATCAT AGCGAGCAAG GGAAAATTAT GGGCTGGATG GAAGGGCTGC GCGGCGTCGG TGTAATGTCG CTGGCAGTGT TTACCATGTG GGTTTTTTCT CGCTTTGCAC CGGATGACAG CACCAGCCTG AAAACGGTCA TTATCATCTA TAGTGTGGTT TACATCTTGT TGGGGATTCT GTGCTGGTTT TTTGTTAGCG ATAACAACAA CCTGCGCAGT GCCAATAACG AAGAAAAACA GTCATTCCAG CTTAGCGACA TCCTTGCCGT TTTGCGTATC AGCACCACCT GGTATTGCAG CATGGTGATT TTTGGCGTCT TCACCATCTA CGCCATTCTG AGTTACTCCA CCAACTATCT GACCGAAATG TATGGCATGT CGCTGGTGGC GGCGAGCTAC ATGGGGATTG TGATCAACAA AATCTTCCGC GCGCTGTGCG GCCCACTTGG CGGCATTATC ACCACCTACA GTAAAGTGAA ATCCCCTACC CGCGTGATCC AAATCCTTTC CGTACTCGGC CTGCTGGCGT TAACTGCCCT GCTCGTCACG AACTCTAACC CGCAATCGGT CGCGATGGGG ATTGGCCTGA TTTTACTGCT GGGATTCACC TGTTACGCCT CACGCGGGCT GTACTGGGCC TGCCCTGGCG AAGCGAGAAC ACCGTCTTAC ATTATGGGCA CCACGGTAGG TATTTGCTCG GTGATTGGAT TCCTGCCGGA TGTCTTCGTT TACCCAATTA TCGGCCACTG GCAAGACACC CAGCCCGCAG CAGAAGCCTA CCGCAATATG TGGCTGATGG GCATGGCGGC GCTTGCCATG GTGATTGTCT TTACCTTTTT GCTGTTCCAA AAAATTCGTA CTGCTGATAG CGCCCCCGCA ATGGCCAGCA GCAAGTAA
|
Protein sequence | MQHNSYRRWI TLAIISFSGG VSFDLAYLRY IYQIPMAKFM GFSNTEIGLI MSTFGIAAII LYAPSGVIAD KFSHRKMITS AMIITGLLGL LMATYPPLWV MLCIQVAFAI TTILMLWSVS IKAASLLGDH SEQGKIMGWM EGLRGVGVMS LAVFTMWVFS RFAPDDSTSL KTVIIIYSVV YILLGILCWF FVSDNNNLRS ANNEEKQSFQ LSDILAVLRI STTWYCSMVI FGVFTIYAIL SYSTNYLTEM YGMSLVAASY MGIVINKIFR ALCGPLGGII TTYSKVKSPT RVIQILSVLG LLALTALLVT NSNPQSVAMG IGLILLLGFT CYASRGLYWA CPGEARTPSY IMGTTVGICS VIGFLPDVFV YPIIGHWQDT QPAAEAYRNM WLMGMAALAM VIVFTFLLFQ KIRTADSAPA MASSK
|
| |