Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1312 |
Symbol | |
ID | 5104563 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1289326 |
End bp | 1290678 |
Gene Length | 1353 bp |
Protein Length | 450 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640507201 |
Product | amino acid permease-associated region |
Protein accession | YP_001191394 |
Protein GI | 146304078 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0531] Amino acid transporters |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.3758 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAAGA AAGAACTTGC AAAGGGATCT GTTTCATATA GAGAAGTAGT GGCTCAAGGA GTGGGTGGAG CGGCCCCGGC AATGGCCAGC CTAGTGACGC TAACAGGGGC TGCAGCCTAC GCCTATGGTT CTTTCCCGCT AGCTGTACTT TTAGCAACGG TAGCGGTACT CCTTGATGCT ACTAGGTTAT CCATTACAAG TAGATACGTT CAAAGTGCTA GAGGGATTTA CGCATTCATC TCAGAAGGTT TGGGAAAGAA AGTTGGTTAT TTTGTGGGTT GGGCTTACGT TCTCTACGCT CTTACGGCTC TAGTTTTTAT CTACCTCTCC ATAGGAGTAT TTCTGCCAGG TGCATTGCAG GTACTTGGGA TTAATACTCC GGGATGGATA TGGGCTCCCC TAGTCGTGGC CGTCGCGCTA TTTGGAGGAA TCCTCTCCTA CCTTGGAATA AGACCATCCC TCAAATTTAC GCTCACCATG AGCATCCTTG AAATAGCGTT CATTCTGGGA ACTTCCCTCT TAATCTTCAC TAAGGTGTCG CCAGATCCTG CCACCTTTAC TCTCAAGTAC GCCCCACCGC CATCCGCATT CAACGTCGGA GTAGGCATGG CTTTCGTGTT CCTGGCCTTC GCGGGATATG AGACTACCTC AGTCCTAGGA GAGGAGGCTG TGGACCCTAA GAACACCATA ACTAAGGGTG TCTTCACCAG TGCCTTGCTC GTGGGGATTA CCTACCTTAT GGCCAGCGAA GCCTTCACTG TGGGTTGGGG GGTCAACGAC ATGTCATCCT TCTTTAGCCA ACTTGTTCCA GGCATCGTCC TGGGAATGAG ATATGGAGGT TTCGTCCTGG CAGTTATCCT AACGATTCTG CTCATAAACA GCGGACTAAC AGACTCTGTA ACTTTCTTCA ACACAGTATC CAGGGTGGTC TACGCCATGG CTAGGGACGG CGTCCTAGAT AAGAGATTGG AGGGAATACA TGATAACAAC AGAACTCCCC ACGTAGCCAT CCTCTTCTCC CTTGCCTTCT CTCTTCTATA CACTCTCATC TTCTCAGCAG CGATAGGGCC AGCTAACGTT TTCTTATCAG TTGGTATCAC CACAACGTTC GGTTTCCTGA TTGCCATATT TACTGCAAAC ATTAGTCTAT TATTCATCTT AAGAAGGTTT AGCGCACTTA ACGTGTGGAA CGTTCTTCTC ACGGTGATCA TAAATGCGAT TCTAGGATTC GTAATATTTG CCAACATAGT TACAACTGCA GTCAATTCCT TCGTTCTCAT TGGAGTTGCT ACATTCGCCG GCTGGATGAT AATCGGGGCA ATTTATTATT GGTTGAGAAA AGTAAGAGTA TAA
|
Protein sequence | MSKKELAKGS VSYREVVAQG VGGAAPAMAS LVTLTGAAAY AYGSFPLAVL LATVAVLLDA TRLSITSRYV QSARGIYAFI SEGLGKKVGY FVGWAYVLYA LTALVFIYLS IGVFLPGALQ VLGINTPGWI WAPLVVAVAL FGGILSYLGI RPSLKFTLTM SILEIAFILG TSLLIFTKVS PDPATFTLKY APPPSAFNVG VGMAFVFLAF AGYETTSVLG EEAVDPKNTI TKGVFTSALL VGITYLMASE AFTVGWGVND MSSFFSQLVP GIVLGMRYGG FVLAVILTIL LINSGLTDSV TFFNTVSRVV YAMARDGVLD KRLEGIHDNN RTPHVAILFS LAFSLLYTLI FSAAIGPANV FLSVGITTTF GFLIAIFTAN ISLLFILRRF SALNVWNVLL TVIINAILGF VIFANIVTTA VNSFVLIGVA TFAGWMIIGA IYYWLRKVRV
|
| |