Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0391 |
Symbol | |
ID | 5103634 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 338981 |
End bp | 340570 |
Gene Length | 1590 bp |
Protein Length | 529 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640506297 |
Product | amino acid transporter-like protein |
Protein accession | YP_001190492 |
Protein GI | 146303176 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0531] Amino acid transporters |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.445572 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAGATA AAAAGACCCT TTTCCTGAGG GAGTCCTCCG GTCTCGTGAG GGAGGTCAGT CCTTGGTCTT CTATGTTTGC CACTTTCGGA CTGGTGACCG GGGGAGTTCC AATTTTAATC CTAACCTGGC TTTTTACAGC GCCTGGGGCC AACTGGCCCA TGGCGTTTCT CCTGACTCTG CCCCCCACCC TTGGTATGGC CTACCTCTTT TACCTAGCGG GGGTAGCCAT GCCCAGAGCA GGTGGGGACT ACGTATTCAA CAGCAGGGCG GTTCATCCGC TCGTGGGCTT CGTGAATTAC TTCGGGCTAT TCATTGGGTT CGGTCTCTCA CTGGGCTACT ACAGTTATCT GGGGGCTCAG TGGTTTGGTT ACCTGTTTTC CGGGTTAGGG CTCGCCTACA ATAACCAGGA ATTCCTAAAC CTAGGAAACT GGTTCTCTGG AACCCAGGGT AGCATCGTGG TGGGATTGAT CATTGTGATA ATCTCGGCTA TCCTAGCCTC AGTCCCTAGG GCCCAGTGGA GATTCGTCAC GGGAGCAGGG ATTATCACGT TCCTCTCTAC GATAATCATG TTCGTAGCTC TGGCTCAGAT AAATCCATCT TCCTTTGCGC ATGCTCTCTC AGCAACGACA GGTATCCCCA ACGCGTACAA CCAGGTGATA AGCGACGCCA CCAGCAATGG GCTCAAGTTC GAGTCACCCC TCTACGCAAC CTTCCTAGCA GCTCCCGTTA TATGGTACTA CTACACGTGG TATAACCTGC CCCTATCCTG GGCTGGGGAG ATGAAGCAGG TAAGAAAGAA CGTACTTTAC GCTACCGTGG TGGCCATACT CATCATTGCC GTTTACTACA CCACGTTCAC TTTCCTCAAC CTTCACGCCT TCGGTTCCAA CTTCCTAACA TCTTGGAGTT ACATCAGTAA CCAAGGAGTT AACGATACGG TGTACTCGAA CCTCCAGAGC ATAGGTGACT TCACCCCGTT CTTCGCCTTC ATTGTAACCC ATAGCCTACC GCTCTTCCTT ATCATGTTCT TGGCCCTATG GTTGCCCAAC TTCTACAGTA ATCCTCCCCT TGTCACAGGC CTTGTGAGGT ATCTCTTCTC CTGGTCCTTC GACAGGATAA TGCCAGAGTG GATGGCTGAC GTTAACGAGA CGTTGAGGGC ACCAGTGAAG GCGACGCTCC TGGTGGGGGC AATGGGCGTC ATAGGTCTGT TCCTTTACGC TTATAATACT CCAGTATCTC TCGTGGATGT CACCGTGGTG TTTGAAATAG GCTACGGTGT GTTCGCCCTT TCCACGGCAT TAATGCCATA TGTTAGGAAG AACGTGTACG AGGGAACCTT CCCCAATAAG ACCAAGGTGG CTGGAATACC CCTTGTGACC ATCGTGGGTA GCCTGGTTTT CGCCTTCACC CTGTTCCTCC TGGCCTACAC ATGGAATAAC CCGGTCCTAT TACCCATCAA CTTAGAGACA ATACTCTCGC TTGTTATAAT ATATGGTCTA GGGATAGCCA TCTACATGAT ATCCTCCCTC AGGGCGAAGA AGAAGGGTCT GGACATAAAC ATATTATTCC AGGAAATACC GCCGGAGTGA
|
Protein sequence | MEDKKTLFLR ESSGLVREVS PWSSMFATFG LVTGGVPILI LTWLFTAPGA NWPMAFLLTL PPTLGMAYLF YLAGVAMPRA GGDYVFNSRA VHPLVGFVNY FGLFIGFGLS LGYYSYLGAQ WFGYLFSGLG LAYNNQEFLN LGNWFSGTQG SIVVGLIIVI ISAILASVPR AQWRFVTGAG IITFLSTIIM FVALAQINPS SFAHALSATT GIPNAYNQVI SDATSNGLKF ESPLYATFLA APVIWYYYTW YNLPLSWAGE MKQVRKNVLY ATVVAILIIA VYYTTFTFLN LHAFGSNFLT SWSYISNQGV NDTVYSNLQS IGDFTPFFAF IVTHSLPLFL IMFLALWLPN FYSNPPLVTG LVRYLFSWSF DRIMPEWMAD VNETLRAPVK ATLLVGAMGV IGLFLYAYNT PVSLVDVTVV FEIGYGVFAL STALMPYVRK NVYEGTFPNK TKVAGIPLVT IVGSLVFAFT LFLLAYTWNN PVLLPINLET ILSLVIIYGL GIAIYMISSL RAKKKGLDIN ILFQEIPPE
|
| |