Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A2914 |
Symbol | |
ID | 5595192 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 2917824 |
End bp | 2919101 |
Gene Length | 1278 bp |
Protein Length | 425 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640922031 |
Product | major facilitator family transporter |
Protein accession | YP_001459542 |
Protein GI | 157162224 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG2223] Nitrate/nitrite transporter |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 0.00716941 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAACACA ACTCATATCG CCGTTGGATA ACCCTCGCGA TAATTAGTTT TAGCGGCGGC GTTAGTTTTG ACCTGGCTTA TTTACGTTAT ATTTATCAAA TTCCCATGGC GAAATTTATG GGATTCAGCA ATACCGAGAT AGGTTTAATA ATGAGTACCT TTAGTATTGC GGCTATTATT CTTTATGCCC CCAGCGGCGT TATTGCCGAT AAATTTTCAC ACCGCAAAAT GATTACTTCC GCGATGATCA TTACCGGATT ACTGGGTCTG TTAATGGCAA CGTATCCACC GCTGTGGGTA ATGCTCTGTA TTCAGGTCGC CTTTGCGATA ACGACGATTT TAATGCTGTG GTCGGTGTCG ATTAAAGCCG CCTCGTTGCT GGGCGATCAT AGTGAGCAAG GGAAAATTAT GGGCTGGATG GAAGGGCTGC GCGGCGTCGG TGTAATGTCG CTGGCGGTGT TTACCATGTG GGTCTTTTCT CGCTTTGCTC CGGATGACAG CGCCAGCCTG AAAACAGTCA TTATCATCTA CAGTGTGGTT TACATCTTGT TGGGGATTCT GTGCTGGTTT TTTGTTAGCG ATAACAACAA CCTGCGCAGT GCCAATAACG AAGAAAAACA GTCATTCCAG CTTAGCGACA TCCTTGCCGT TTTGCGCATC AGCACCACCT GGTATTGCAG CATGGTGATT TTTGGTGTCT TCACCATCTA CGCCATTCTG AGTTACTCCA CCAACTATCT GACCGAAATG TACGGCATGT CGCTGGTGGC GGCGAGCTAC ATGGGGATTG TGATCAACAA AATCTTCCGC GCGCTGTGCG GCCCACTTGG CGGCATTATC ACCACCTACA GTAAAGTGAA ATCCCCTACC CGCGTGATCC AAATCCTTTC CGTACTCGGC CTGCTGGCGT TAACTGCCCT GCTCGTCACG AACTCTAACC CGCAATCGGT CGCGATGGGG ATTGGCCTGA TTTTACTGCT GGGATTCACC TGTTACGCCT CACGCGGGCT GTACTGGGCC TGCCCTGGCG AAGCGAGAAC ACCGTCTTAC ATTATGGGCA CCACGGTAGG TATTTGTTCG GTGATTGGAT TCCTGCCGGA TGTCTTCGTT TACCCGATTA TCGGCCACTG GCAAGACACC CTGCCCGCAG CAGAAGCCTA CCGCAATATG TGGCTGATGG GAATGGCGGC GCTTGCCATG GTGATTGTCT TTACCTTTTT GCTGTTCCAA AAAATTCGTA CTGCTGATAG CGCCCCCGCA ATGGCTAGCA GCAAGTAA
|
Protein sequence | MQHNSYRRWI TLAIISFSGG VSFDLAYLRY IYQIPMAKFM GFSNTEIGLI MSTFSIAAII LYAPSGVIAD KFSHRKMITS AMIITGLLGL LMATYPPLWV MLCIQVAFAI TTILMLWSVS IKAASLLGDH SEQGKIMGWM EGLRGVGVMS LAVFTMWVFS RFAPDDSASL KTVIIIYSVV YILLGILCWF FVSDNNNLRS ANNEEKQSFQ LSDILAVLRI STTWYCSMVI FGVFTIYAIL SYSTNYLTEM YGMSLVAASY MGIVINKIFR ALCGPLGGII TTYSKVKSPT RVIQILSVLG LLALTALLVT NSNPQSVAMG IGLILLLGFT CYASRGLYWA CPGEARTPSY IMGTTVGICS VIGFLPDVFV YPIIGHWQDT LPAAEAYRNM WLMGMAALAM VIVFTFLLFQ KIRTADSAPA MASSK
|
| |