Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1455 |
Symbol | |
ID | 5104825 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 1422222 |
End bp | 1424129 |
Gene Length | 1908 bp |
Protein Length | 635 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640507343 |
Product | amino acid permease-associated region |
Protein accession | YP_001191536 |
Protein GI | 146304220 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0531] Amino acid transporters |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAGGAG AAAAGTCTCA AAAAGCAGGG GATTTTGGGT TAGAATCCGA TAAGCAACTA AGAAGATCTC TAGGAAAGTT CGAGTTGCTA TATCTTTCAC TGGGAGGAAT CATAGGATCA GGATGGTTAT TCGGAGCCCT ATATACCGCG GAAGATGCTG GTGGATCTGC TATATTGTCT TGGATAATCG CAGGAGTACT CGTTCTTTTC GTCGGTCTAG CTTACTCTGA ATTAGGTTCT GCCATACCAA AGTCCGGCGG TATAGTGAGG TATCCGCATT ATTCTCATGG AGGTGTAGCT GGATATATTA TCACGTGGAC CTATTTTCTT TCCGCGGCAT CGGTTCCTGC CATAGAGGCC ACAGCCACAG TAACCTATCT ATCTAGCTTG GTTCCAGCAC TTACAGTAAA CGGAGTGCTA ACTCCACTGG GCATACTGAC AGCATATCTA TTCCTTCTAT TCTTCTTCTT CCTTAATTAC ATCGGTGTAA ATATACTAGG AAAAGTTACA CACGGTGCAG GTTGGTGGAA GTTACTGATT CCCTCAATAA CCGTAGTAAT TCTTCTTATA TTTTATTTCC ATCCAGCTAA CTTCACTCTT GGCGGAGGAT TTTTCCCTTC TGCTTCTAAT GTTGCTGCTG GTTCATCAGG GATCTATGGA TTTTCTGCAG TACTTTACGC AATTCCGACT ACCGGTGTAA TATTTTCCTA CCTAGGTTTT AGACAGGCAG TAGAATATGG TGGTGAAGGT AGAAATCCAA AGAAAGATAT TCCCTTTGCG GTCATGGGCT CCCTAGTAAT TGCGCTTATT CTATACACAC TTTTACAGGT AGCATTTATA GGAGCAATAA ACTGGAACGC GCTCACTGTT ACAAGGGGAA ATACAACAGT ACCTGTTACG CCAGGCAACT GGACAGAGTT AGGTCAAACT GCCATTTCAT CGGGACCATT CTATCAGATT TTCAAACTAG CGGCTCCTCT AGGTCTATTA TCATTAATTT TCAGTGGATG GGCATACATA CTGCTTCTAG ATGCTGTGAT TTCTCCGAGC GGGACAGGGT GGATCTATAC TGGTACCAGT ACTAGGACAA TGTATGGCTT CGCTACAAAC GGCTACTTGC CTGGTATCTT TCTAAAGGTG GGTAAAACTA GGATTCCAAT TTACTCTTTG ATAGCCGCGA CTATCATTGC CGCAATATTC ATGTTACCCT TCCCATCTTG GCAATCGCTG GTTGGTTTCA TAAGTTCGGC CACAGTCTTT ACCTACATAA TGGGAGGGAT AGGGCTTGAG ACTCTCAGGA AGACCGCCCC AGAACTCAAT AGGCCGTACA AGTTGCCCTT AGCAAGGGTT ATAGCGCCAA TTGCTACACT TGCGGCGGGC CTGATAGTGT ATTGGTCAGG TTTTGCCACC CTGTTCTACG TTATCACTGG GATATTCTTG GGATTTGCCT TATTTTTTGG CTACTATGCC TTCAAGGTTA TGGGAATTAA TAAGGCGTTT TCCGCTATCG TAGGATTGGT AAACATAGTG GTGACCCTAG TGTTAGCCTT TGAATTCTAC GGTGCCACCT CCGGTCTAAC TGCAGCGAAC AATGTGGCGT TCTTGATCTA TATCCTAGTC ATGGCAGGCC TAGTAGCGTT TGATGTGGGA GTGCTTCATG CATTTGGCAA GGGTGAAGAT GTGAAAAGGG AGATAACTGC TAGCTACTGG TTGCTAGCCT ACATTTTCGT AGTAGCCATC ATTTCATACT TTGGAGGTTT CGGACTAAAT CCGGTGATTC CATTCCCCGA GGACACCATT GTGGCTGCAG TGGTTACTCT AGCAGCCCAC TATGGGGCAG TGAAAAGCGG ATTTAGAACT CAGGCCATAC AAGATATCCT AGAGGAAACA AGGGAGACCC CACCCTAA
|
Protein sequence | MSGEKSQKAG DFGLESDKQL RRSLGKFELL YLSLGGIIGS GWLFGALYTA EDAGGSAILS WIIAGVLVLF VGLAYSELGS AIPKSGGIVR YPHYSHGGVA GYIITWTYFL SAASVPAIEA TATVTYLSSL VPALTVNGVL TPLGILTAYL FLLFFFFLNY IGVNILGKVT HGAGWWKLLI PSITVVILLI FYFHPANFTL GGGFFPSASN VAAGSSGIYG FSAVLYAIPT TGVIFSYLGF RQAVEYGGEG RNPKKDIPFA VMGSLVIALI LYTLLQVAFI GAINWNALTV TRGNTTVPVT PGNWTELGQT AISSGPFYQI FKLAAPLGLL SLIFSGWAYI LLLDAVISPS GTGWIYTGTS TRTMYGFATN GYLPGIFLKV GKTRIPIYSL IAATIIAAIF MLPFPSWQSL VGFISSATVF TYIMGGIGLE TLRKTAPELN RPYKLPLARV IAPIATLAAG LIVYWSGFAT LFYVITGIFL GFALFFGYYA FKVMGINKAF SAIVGLVNIV VTLVLAFEFY GATSGLTAAN NVAFLIYILV MAGLVAFDVG VLHAFGKGED VKREITASYW LLAYIFVVAI ISYFGGFGLN PVIPFPEDTI VAAVVTLAAH YGAVKSGFRT QAIQDILEET RETPP
|
| |