Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0462 |
Symbol | |
ID | 5105458 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 416383 |
End bp | 417741 |
Gene Length | 1359 bp |
Protein Length | 452 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640506368 |
Product | major facilitator transporter |
Protein accession | YP_001190563 |
Protein GI | 146303247 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG2223] Nitrate/nitrite transporter |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.042885 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATATTAT GGACAGATGC CATGGAACCT GAAAGATTTG GAGTTGATTC TGTAAAGGCG GTCTCATCCC AGTTTGTGGG ATTCCTTCTC GACTCCTACG ATCTAACAAT GATTTTGAGT ATAGCTCCCA TCTTAGCCAA GGTTCTTCTA CCTCCCGAGT CCCCCCTGCT TGCTACCTTT AACATAATTC TGAGTTACTC CTTAACAATC ATCTTTAGAC CTCTTGGCTC AGCCATATTT GGAAACCTTG GCGATAAGAT TGGGAGGAGA GCTGACCTAA TCATCACAGT CCTTGGCCTA GGCTTGGTAA GTGCACTCAC TGCTGCCCTT CCCACATACT CGGAGGTGGG AATACTCTCC TTCGTTCTTT TTGTTCTCCT CAGGGTACTG GTTGGGATTT TCGCTGGCGG CGAGTACTCC GCGGGCCATC CCTTTGCCAT GGAGTGGACC CCTTTCAAGT GGAGAGGTTT GATAAGCGGA TTTGTACAGG GAGGTTTTTC CTTTGGAGCT GCCCTGGCAG CTGTAGTTGA GGGAGCCTTC GTGGGAATAT ACGGGTTAAA GGGAGTGGAG GACTTCGCAT GGAGATATGT TTTCCTTACT GCTCTAGCAC CCGCGGTCAT CGCCCTAGCT GTGAGGCTAT CAATGAAGGA GACTCCGGTG TTTCAGGACG TGAAGAACAG GAACATGGTC AGGAAAAGTC CGCTCACAGA TCTGTTTAGA AAGCCCTACA GGAGGGACTT CTTTCAGGTT ATGGTGTATA TGACTGGTAT GTTCTTTTAC GCCTATTCCC TATTTGCCTT CGTGCCAGCG ATTCTTGAAC ACGCTCCCTC GGTGTTCTCC CTTCAAGAGG CTGAGAGCAT CTACTCCTAC GGTACTTATG CAGCCTTTGC AGGTGCAGTG ACGTTTGGGG CCCTTTCCCA GTACCTAGGG AGGAGAAGAT TGACACTGAT TTGGGTCTTC ATTACCTTAA TTCTCTCTGT CCCTGTATAC TATCTTCTCT TTACCTCCGC GAAGTCAGGT AATGTGGTCG GAGCATCCTT AGCTTCCGTG TTGATAGGAA TAATAACTCA GGCTCCCTGG GGAGTAATAC CCATCTATCT CTCTGAGAGA TTCAAGGCCT CAATGAGAGC TTCAGGTGTA GGTTTTGGGT ATTCATCTGG AATATTTGTG GGAGGTTGGT TTAGCATATA CGTGGAACTT ATGCACGAAT ACCTATTCAA GGGAATTGAC ACTCCAGAGA ACGTTTGGTT CTCTACAGCA GCTTTGCTAA TCCTAGGTGC AATATTTGTG GGGATAGGAC AATACCTGGG TCCAGAAACC CTGGGAACGA GACTAACCGA AGAACCCCAA AAGGTATAA
|
Protein sequence | MILWTDAMEP ERFGVDSVKA VSSQFVGFLL DSYDLTMILS IAPILAKVLL PPESPLLATF NIILSYSLTI IFRPLGSAIF GNLGDKIGRR ADLIITVLGL GLVSALTAAL PTYSEVGILS FVLFVLLRVL VGIFAGGEYS AGHPFAMEWT PFKWRGLISG FVQGGFSFGA ALAAVVEGAF VGIYGLKGVE DFAWRYVFLT ALAPAVIALA VRLSMKETPV FQDVKNRNMV RKSPLTDLFR KPYRRDFFQV MVYMTGMFFY AYSLFAFVPA ILEHAPSVFS LQEAESIYSY GTYAAFAGAV TFGALSQYLG RRRLTLIWVF ITLILSVPVY YLLFTSAKSG NVVGASLASV LIGIITQAPW GVIPIYLSER FKASMRASGV GFGYSSGIFV GGWFSIYVEL MHEYLFKGID TPENVWFSTA ALLILGAIFV GIGQYLGPET LGTRLTEEPQ KV
|
| |