Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A2103 |
Symbol | |
ID | 3784674 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 2393683 |
End bp | 2394951 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637812191 |
Product | major facilitator transporter |
Protein accession | YP_412788 |
Protein GI | 82703222 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.941752 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATCAGG TAAAGAAGGT TAAGTCGGTG GAAACAGGCC GGATCGGCAG AACACTTACC CGCCTGCTCA TGCCGGTAGG AGTAGCCGAG GTTGCTTCCC CCTTACTGAT CGCTCGCGCA CTGCGTGGCT TTGCAGACGG TTATGTCGTT GTGCTTTTGC CTGCGTACCT GCTGGCGCTG GGGTTTGACC AGCTATATGT AGGCTATTTA AGTACGGCTA CCCTTGCGGG ATCCGCCCTG GCGACATTGG CGATCGGAGC AATGGGCTAC CGCTGGTCAA GCCGTCGGCT GTTGCTGTTC GCTGCCCTGT TAATGACAGC TACCGGCCTT GCTTTCGCGA GCGTTTCTTC GTTTTTGCCG CTCCTCATTG TCGCATTTGT TGGAACCCTT AATCCGAGTT CGGGCGATGT GAGCATGTTT TTACCCCTGG AGCATGCTCG TCTGGCACAG GGAGCAACTG GTAACCTACG CACCACTCTG TTTGCCCGTT ACACGCTTAC TGGTTCGTTA AGCGCGGCTG TAGGTGCCTT GGCCGCTGGC ATCCCCTCCT GGCTGATGCA ATGGGCAGGG CTACCCTTGA TTGATGGTTT GCGCACAATG TTTGTGCTGT ACGGCCTGCT CGGCGGATTG GTTTGGCTGC TCTATCGGGC TCTGCCAGCG GGCGACATCT CTACCAAGCA GTTACCAGCG CCGCTAGGTC CGTCACGCAG TATCGTTTTC AAGCTTGCGA TGTTGTTCAG CCTTGATGCA TTTGCCGGGG GATTGGTGGT GAATTCGCTA CTCACACTCT GGCTTTTTGA GCGATTTGGA CTCTCACCGG GAGAGGCAGG AACCTTTTTC TTCTGCGCAG GGCTTCTATC GGCCGGATCG CAGCTCGCTG CACCTGTAAT TGCCCGCAAG ATCGGACTGC TTAACACGAT GGTCTTCACT CACATTCCTG CAAACGTGTG TCTCATTTTT GCCGCAATTG CTCCAAACGT AGAAATCGCG CTGACGCTAT TGTTCGTGCG AAGTGCATTA TCGCAGATGG ATGTGCCCAC ACGAACTGCC TATGTGATGG CTGCAGTTAC GCCACCCGAG CGTGCTGCCG CGGCAAGCTT CACTACCGTA CCTCGCAGCC TTGCCTCAGT CCTGAGTCCA AGCTTGGCCG GCGCCTTGCT GGCCAGCGGG CTTCTTAGCG CTCCCTTAAT TGCTTGTGGA GCTTTAAAGA TTGCATATGA CTTGGCGTTG CTGGTGTCCT TCAGCCGCGT GAATGTTCCG CTGGATTAG
|
Protein sequence | MNQVKKVKSV ETGRIGRTLT RLLMPVGVAE VASPLLIARA LRGFADGYVV VLLPAYLLAL GFDQLYVGYL STATLAGSAL ATLAIGAMGY RWSSRRLLLF AALLMTATGL AFASVSSFLP LLIVAFVGTL NPSSGDVSMF LPLEHARLAQ GATGNLRTTL FARYTLTGSL SAAVGALAAG IPSWLMQWAG LPLIDGLRTM FVLYGLLGGL VWLLYRALPA GDISTKQLPA PLGPSRSIVF KLAMLFSLDA FAGGLVVNSL LTLWLFERFG LSPGEAGTFF FCAGLLSAGS QLAAPVIARK IGLLNTMVFT HIPANVCLIF AAIAPNVEIA LTLLFVRSAL SQMDVPTRTA YVMAAVTPPE RAAAASFTTV PRSLASVLSP SLAGALLASG LLSAPLIACG ALKIAYDLAL LVSFSRVNVP LD
|
| |