Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_2205 |
Symbol | |
ID | 5105425 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 2116077 |
End bp | 2116973 |
Gene Length | 897 bp |
Protein Length | 298 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640508098 |
Product | binding-protein-dependent transport systems inner membrane component |
Protein accession | YP_001192267 |
Protein GI | 146304951 |
COG category | [E] Amino acid transport and metabolism [P] Inorganic ion transport and metabolism |
COG ID | [COG1173] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.00000404116 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGAAAAAC AGATCCAAGA GGAAGAGGAG AAGAGATTTT ACAACTTGAA ATTATCATTA AAGGTCTTCT TTTCCAATAG GACCGCTGTA GCTGGCTTAA TTATCTTCCT AGGTTATGTT GCTGACGCTC TACTGATGCA GTTTTACCCT GAGATTGTGG GCGTTAGAAA TCCAAATACC TTGATTTACA ACTTCATTAA TCCTGTTCCC CAACCGCCTT CGGCTAAGTA TCCCCTTGGA ACAACTTATC CTGGTGTAAA TCTACTGCAG GCAATAATGG AGGCCATAAG GATAGATCTA GGATTCTCGT TGCTGATTGT GGTTAGCGGT GCATTAATCG GAGCAGTCAT AGGAGTACTA GCGGCGTACG TGGGAGGGTA CTTGGATGAG GTTCTAATGA GAATCACTGA TATTTTCTTT AGCGTACCCT TCCTAGTCTT GGCTCTCGCG GTAGGCTTTG TTCTAGGACG CAGTCTAAAC AGCATGGTAA TAGCACTGAT CATAGTATGG TGGCCCATTT ACGCTAGATA TTCCAGGAGT CTTACTCTGA GCCTCAGGGA GAGCATGTTC ATTGAGGCTG CTAAGGCGTC AGGTGCCAGT AACGCTAGAA TTATGTTCAG GCATATCCTT CCCAACACCT TACCACCAAT CCTAGTACAG ATCTCGTTAG ATCTAGGATC AGTTGTGGGC ATATTCGCCA CTCTCGCGTT TATAGGGTTC ATCCCAAACG CAAACATACC TGAACTTGGA TACCTGACAA GTTTAGGCCT AAACTACATA CAATCTGCTC CCTGGACTGT GATATTTCCA GGATTAGCCA TAACCTTATT CGCTCTCTCT GTGAATCTAA TGGGAGATGG TCTTAGAGAC GTCATAGATC CCAGGAGGAG AAGCTGA
|
Protein sequence | MEKQIQEEEE KRFYNLKLSL KVFFSNRTAV AGLIIFLGYV ADALLMQFYP EIVGVRNPNT LIYNFINPVP QPPSAKYPLG TTYPGVNLLQ AIMEAIRIDL GFSLLIVVSG ALIGAVIGVL AAYVGGYLDE VLMRITDIFF SVPFLVLALA VGFVLGRSLN SMVIALIIVW WPIYARYSRS LTLSLRESMF IEAAKASGAS NARIMFRHIL PNTLPPILVQ ISLDLGSVVG IFATLAFIGF IPNANIPELG YLTSLGLNYI QSAPWTVIFP GLAITLFALS VNLMGDGLRD VIDPRRRS
|
| |