Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1121 |
Symbol | |
ID | 5104154 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 1050494 |
End bp | 1051966 |
Gene Length | 1473 bp |
Protein Length | 490 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640507014 |
Product | amino acid permease-associated region |
Protein accession | YP_001191207 |
Protein GI | 146303891 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0833] Amino acid transporters |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0304101 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCAGACA AACCTTTCAA ATTGGCGAAG GTAATAGGAC CCGTGGCCAT AATAGCCTCG GCAGTAAGTC AAGAATACGG AGCTGGTATC AACGCGGTAG CCACACAGAG TATTGGATCA TATCCTGCCA TACTCAACCT TGTTCCAGCA ATCATGTTCA TAACTGGATT GCTCATGCTT CCCAAGGTAT TCATGTATCA GAAGTTTGGC AAGGTGGCAA GCAGAAGCGG AGGACAATAC GTCTGGATAT CTAGAACTAC CACTCCGGAG GTGGGATTTA TTGTTCACTT TCTATACTGG ATTGGAATAG TATCTGCCAT AGGGTTCATT AGCTACACTG TTGGATCTAC CCTAGCTTCA ACACTCGTGT CATTGGGGAT ATCCTCTGGG GCGTGGTTCG CTACATTTAC AGGGCATATT GTGCTGGGGT TGGCCCTAAT ATGGTCCTTC TTCTTAATTC ATTACACGGG CGTGAGAAGC TATGGAGTTG TGGTAACTCT GCTGTTTGCC CTCGTTTTGC TAGGGGCAAT CATATCAATG GTTGCGGGTT TCGGCACCGC TAATTCTGTC TACACTGGTT ATTTATCGAG TCAAATATTT CATGGAACGA TTCCCAGTTA CACTACACCT CCCCTAACTT ACTCGGATAT TTTCGGTACA GTAACCCTGT TCATTTTTGC GTATGCGGGC ATAAGCGCGG CCCCTCTCCT AGGTGGAGAG GCTAAGGATC CCAAAAAGGA CATGCCAAGG GGTATATTCC TAGCGTGGTT GATTGCGTTA GTCCTGTTTA CCTTAGTTTC GCTTGCAGTC TTCCACGCAA TAACTGGAGG GCAAGTGTTT GCGTTAATAA AATCAAAGTA TTCCTATTAC GCTACCATTC CTGGCATACT GAGCATATCT GAACCGAAAC TTATCGGAGC TATATTCTCA ATCATAGTTA CAATTATTAT AATGAAGACA ATCATGCCCC AGTTACTTAC CTCCAGTAGA ACGCTCTTTG CCTGGGGCCA AGACAAGATA CTTCCTGAGG TCTTCACTCA CACTAACAAG TTTAAGGCAC CCGACTTCTC CCTGCTGGTA TGCGCGCTAT TTGCATCAAT ATACCTAGTT TATACAACTA GCGTGGGTGT GTCCGCTGTG GACGTAAGAT CCCTCTCTGT CCTACTTGAG ATGATGGCTC TCGGGGCAGG GGTACTTCTT ATCTCGACCA AGAGTAGCAA GAAAGAATGG GAAAAGGAAG TGACGACAAT AGGTGCGATC ATAGCAGGGT TAGCAGGTAT AATAGTCACG CTTATTATTA TTCCAAGCGT CGCCGTTGTA CCCCACGTTT CAATTCTCCT TCAACCCTCG TTTCAAGTGA TATTGGTTAT AGTGATAGGT TTCCTCATCT ATGAAATCGC AAAAATGTAT AACAAACGGA CTAAAAACAT CGATCTAAAT GATCTAATAA AGAAAGAGCT ACCCCTGGAA TGA
|
Protein sequence | MSDKPFKLAK VIGPVAIIAS AVSQEYGAGI NAVATQSIGS YPAILNLVPA IMFITGLLML PKVFMYQKFG KVASRSGGQY VWISRTTTPE VGFIVHFLYW IGIVSAIGFI SYTVGSTLAS TLVSLGISSG AWFATFTGHI VLGLALIWSF FLIHYTGVRS YGVVVTLLFA LVLLGAIISM VAGFGTANSV YTGYLSSQIF HGTIPSYTTP PLTYSDIFGT VTLFIFAYAG ISAAPLLGGE AKDPKKDMPR GIFLAWLIAL VLFTLVSLAV FHAITGGQVF ALIKSKYSYY ATIPGILSIS EPKLIGAIFS IIVTIIIMKT IMPQLLTSSR TLFAWGQDKI LPEVFTHTNK FKAPDFSLLV CALFASIYLV YTTSVGVSAV DVRSLSVLLE MMALGAGVLL ISTKSSKKEW EKEVTTIGAI IAGLAGIIVT LIIIPSVAVV PHVSILLQPS FQVILVIVIG FLIYEIAKMY NKRTKNIDLN DLIKKELPLE
|
| |