Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0426 |
Symbol | |
ID | 5105543 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 377444 |
End bp | 378802 |
Gene Length | 1359 bp |
Protein Length | 452 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640506332 |
Product | major facilitator transporter |
Protein accession | YP_001190527 |
Protein GI | 146303211 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG2223] Nitrate/nitrite transporter |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCATCGC TGACGTGGAG AAACGTAATA GTCTCAGGGA TGGGAGTTCT CACTGACGGG TATAATCTTT ACTCGATCTC CCTCACTTCG TTCTTCATTC CCTCTTCTTT CACATTCTCC AGTGCAGAGC TTGGATTACT CGTAGCAGGG TCATATTACG GTGCTGCGAT CGCTGCTCTT CTGTTTGGTC TCTTAGCTGA CAGGATAGGA AGAAAGAGGA TTTACGGCTT CGATGTCCTG ATCATGGCCA TAGGGGCAGG ACTGCAGGCT TTCTCACAGT CCTATCTGGA GCTCTTTTTA GCTAGGTTAA TTCTTGGAGT GGGCATAGGG GCAGATTACG TCCTTTCACC TGTCATAGTA GCTGAGAATG CGGAGGGAAG GAACAGGGGA AAGGCAATGG TAGTGACCTT TGCCGTTATG TGGGGTCTCG GTGCAGTTAT TGCTGCGTTC GTGGAACAGA TGGGCTTACT CGCCCACCTT CCTGCAACCT TACTCTGGAG AGTAGTTCTG GGAATGGGTG CTATTCCAGC GATTTCAGTC TTCTTTCTGA GGAGGAAGAT ATACGAGACA ATGTTGTTCG TGTCCAGGGT CAACCCAGAG CACCAGGATG TGTCTAAGAT AGAGCAGGAA TTGGGGAAAC CCCTGCCCAA GGCTAAGGAC ACAACCCCAT TCTTGAGGAG GCTTTCCTCT TCTGCACTGT TAATCGTGGC AGCCTCTGTC CTCTGGCTAC TTTACGACAT GTACTCCTCG ACATTTGCCA TTTTCGGCCC CATCACTATA GCCTCAAATC TGGGACTCTC TCCCATAGAG TTCACGTATG TCGCGCAGTT CTTTGCGGGA ATTCCGGGAC AGTTAATATG CATCTATCTT GTGGACAAAA TAGGGAGAAA GCCCTTAATC GTGATAGGCT ACGCTGGAGT GGCCCTATGG CTTTTCGCCT ATTCCCTTCT ACTGGAGGAT CCCAGAATCT TTGGACTTCC GGAGGCTCAA CTCTCCGTAT CCAAGTTAGT GGGAGAGGCC GCAATTCTCG GTTTCTCGTT TTACATGCTA AACTACCTCT TCTCGGCCAT AGGTCCGGCA TCAATCATAG GCTCGGCGAT GGTGACGCCT GAGCTAGTTC CCACTAAGGT TAGGGCAACT AGCCAGGCCA TAAGCGTCAG CGTCGACAGA TTGGCTACTG CCCTTAACAT AACTGCCTTC CCGTTACTTC TCTCGCATTA TGGACTAGGA GCTATGGTTG GATTTTATGC AGGAATAGCA CTAATTTCCA CCATAATAAC ACTCTTCGTC ATTCCCGAAA CAAAGGGACA GGAATTAGAG AAAGTTGTTA AGGAGAGAAA CGTGGGAGAG GGATTATAA
|
Protein sequence | MASLTWRNVI VSGMGVLTDG YNLYSISLTS FFIPSSFTFS SAELGLLVAG SYYGAAIAAL LFGLLADRIG RKRIYGFDVL IMAIGAGLQA FSQSYLELFL ARLILGVGIG ADYVLSPVIV AENAEGRNRG KAMVVTFAVM WGLGAVIAAF VEQMGLLAHL PATLLWRVVL GMGAIPAISV FFLRRKIYET MLFVSRVNPE HQDVSKIEQE LGKPLPKAKD TTPFLRRLSS SALLIVAASV LWLLYDMYSS TFAIFGPITI ASNLGLSPIE FTYVAQFFAG IPGQLICIYL VDKIGRKPLI VIGYAGVALW LFAYSLLLED PRIFGLPEAQ LSVSKLVGEA AILGFSFYML NYLFSAIGPA SIIGSAMVTP ELVPTKVRAT SQAISVSVDR LATALNITAF PLLLSHYGLG AMVGFYAGIA LISTIITLFV IPETKGQELE KVVKERNVGE GL
|
| |