Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1047 |
Symbol | |
ID | 5104429 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 975701 |
End bp | 976768 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640506943 |
Product | binding-protein-dependent transport systems inner membrane component |
Protein accession | YP_001191136 |
Protein GI | 146303820 |
COG category | [E] Amino acid transport and metabolism [P] Inorganic ion transport and metabolism |
COG ID | [COG0601] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGGCTAG CGAAATTAAT AGGAAAACGT GTGGCTGGGG CGCTCCTAGT TATTTTTGGT GAGTTAATTC TCGTCTTTTT CTTGGTAAAC GTTGCGGCCC CGAATCCGGC TTCCATATGG GCTGGCCCAG AGGCTTCACC TGAACAAATA CAGATTATTA CCGAGCTTTA TCATCTTAAC TCCCCGTGGT ACGTTCAGTT CCTCTATTAT ATCAAGAACT TCTTCACAGG AAATTGGGGG ATTTCACCCC TTTATCAAAC TCCCGTGATA CAGCTCGTGG AGGAGTACCT TCCTGTTACG CTTGAGCTTG CGGTAATATC ACTCGTACTC AAGTTGATTA TTGAGATTCC TCTAGGTGTC CTATCTGGTC TCAAGCCAAA CGGGCTTCTG GACAATGCCA TCAGGATGAT CTACACGATA ACGAGAAGTG TCCCTCCCTT CTTCGTGGCA CTGGGTCTCC TCCTCGTGTT AGCCTATGAT GTACACGTAT TCCCCGCATC ATATCCTGTG GATCCAATCC TGGCCCTGAA GGAACCAAAG TTCGAGATAT ATGACCCATT TAACGGGAAG TATTACCCGT TCTGGCTCCT TGATAACATG CCCATCCTGA ATGCCCTTCT GGTGGGAGAT TTCAGTGCCT TTGCCTCTGC TCTGGACCAC GCGATTTTAC CGGCTCTCAG TCTCACCCTC TTCGGTTTCG GTGGAATAAC GAGGCTCTCA AGAAACTCCA TGATAGAGGC CCTCAACATG GATTACATAA AGACAGCGAG GGCGAAGGGT TTGAAGGAGA GGGTAATAAT ATTTCGTCAT GCCCTAAGGA ATTCCCTTCT CCCAACCATA ACTCTCTCCA GCGTAATATT CGCCGGTTCA ATTCAAGGGG CCTTAGTGGT GGAAACAATC TTTAACTATT ACGGGATGGG GTATTACCTA GCCCAAAGCC TACTTGACCT TGACACTCCT TCCTTACTAG CAGGGACCGT TGTGGTTACC ATAGTTGTAG TTATCTCTAA CCTTGTCGCA GATGTGTTGT ACAGTGTGGT AGATCCTAGG GTGAGGGAGA CAACATGA
|
Protein sequence | MGLAKLIGKR VAGALLVIFG ELILVFFLVN VAAPNPASIW AGPEASPEQI QIITELYHLN SPWYVQFLYY IKNFFTGNWG ISPLYQTPVI QLVEEYLPVT LELAVISLVL KLIIEIPLGV LSGLKPNGLL DNAIRMIYTI TRSVPPFFVA LGLLLVLAYD VHVFPASYPV DPILALKEPK FEIYDPFNGK YYPFWLLDNM PILNALLVGD FSAFASALDH AILPALSLTL FGFGGITRLS RNSMIEALNM DYIKTARAKG LKERVIIFRH ALRNSLLPTI TLSSVIFAGS IQGALVVETI FNYYGMGYYL AQSLLDLDTP SLLAGTVVVT IVVVISNLVA DVLYSVVDPR VRETT
|
| |