Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_2207 |
Symbol | |
ID | 5105427 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 2118331 |
End bp | 2119581 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640508100 |
Product | oligopeptide/dipeptide ABC transporter, ATPase subunit |
Protein accession | YP_001192269 |
Protein GI | 146304953 |
COG category | [E] Amino acid transport and metabolism [P] Inorganic ion transport and metabolism |
COG ID | [COG0444] ABC-type dipeptide/oligopeptide/nickel transport system, ATPase component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.00000444837 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGATTGGGG CATTCAATCT AAAGAAATAC TTTCCTGTGA GAGGTTCTGC CCTAAAGAGC CTCTACGTTA AGGCTGTAGA TAATGTCTCC ATAAAAGTAA ACAGGGGTGA GGTCCTAGGA ATAGTTGGGG AAAGTGGTTC AGGCAAATCG ACCCTGGGAA GATTACTCAT ACGCTTACTA GAACCGACTG CCGGAGAGAT ACTCTTCGAT GCACCAGAAG AAGAATTGAA AAGATACGAG GATGCTCTCC TAACTAACGA CGAAAAAACC ATGAAGGAGA TATCTACCCG GTATTCCCTT CTATCGAAAA AGGGGTCAGA ACTCAGAAAA CTAAGAACAA GAATGAACAT GGTTTTTCAG GATCCCTATT CGTCAATAGA CCCCAGATAT AGGATTCTTG ACGTGATAAT GGAGCCCATG ATATCAACCG GATACCTTAA AGGGGAAGAA GCAAGGAGAA AGGTCTATGA CCTTCTAGAG GAAGTTGGTC TACCTAGAAA CTTTGCCATG AGATACCCGC ACGAACTATC GGGAGGACAA AGACAAAGAG TTGCTATAGC CCGTGCTCTT GCCACCGATC CTGACCTACT CATTCTTGAT GAACCCACAA GTGCCTTAGA TGTATCGGTT CAAGCTCAGA TACTGAACCT ACTCAACGAG TTAAGGAGGA AGAAAAACAT AACCATGGTC CTAATAACGC ATAACATCGC TGTAGTAAGT TACATGGCAA ATAGGGTTGC AGTCATGTAC TCTGGACGAC TTATGGAGAT TGGGGATAAG GAGAGCGTAT TAAACAACCC GAAGCACCCT TACACGATGG CACTTATCTC CTCTGTCCCA AGGCCTGAGC CAGGATCCAC TAGAAAAAGA ATAATTCTGA AGGGAGATCC ACCCAACCTA ATAAATCCAC CAAGAGGATG CGTCTTTCAC CCTAGATGTC CCATGGCATT TGAGAAGTGT GGATGGAGCG TCGACGAGAT AATGGAAGAT ATGAACTACC TTCTACAAGG AAAGTATTAT AACCTGTTTG AGAAGGCTAG CGTCATTATC GAGGGGACTA AGATGTACGT AAGAAACGCA AATATTGAAC TGCTAAGGAA AGTCATTAAT GAGGAAAAGG ACAAAATAAG ATCTCTTACC TCTATCGTAG ATGTGACGGA AGACGGAGAA GTGAGGATAT CTGAGTTCGA GGAACCTAAG CTCTTCAAGG AAAACGATAA CAGGGAAGTA GCTTGCCTCC TTTTCAAGTA G
|
Protein sequence | MIGAFNLKKY FPVRGSALKS LYVKAVDNVS IKVNRGEVLG IVGESGSGKS TLGRLLIRLL EPTAGEILFD APEEELKRYE DALLTNDEKT MKEISTRYSL LSKKGSELRK LRTRMNMVFQ DPYSSIDPRY RILDVIMEPM ISTGYLKGEE ARRKVYDLLE EVGLPRNFAM RYPHELSGGQ RQRVAIARAL ATDPDLLILD EPTSALDVSV QAQILNLLNE LRRKKNITMV LITHNIAVVS YMANRVAVMY SGRLMEIGDK ESVLNNPKHP YTMALISSVP RPEPGSTRKR IILKGDPPNL INPPRGCVFH PRCPMAFEKC GWSVDEIMED MNYLLQGKYY NLFEKASVII EGTKMYVRNA NIELLRKVIN EEKDKIRSLT SIVDVTEDGE VRISEFEEPK LFKENDNREV ACLLFK
|
| |