Gene Msed_1447 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1447 
Symbol 
ID5104817 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1414270 
End bp1415532 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content48% 
IMG OID640507335 
Productamino acid permease-associated region 
Protein accessionYP_001191528 
Protein GI146304212 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0531] Amino acid transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCAGTA AACCAAAAAT CTCACCCACT GAGGTGTTCT TCCTGTCCTT TGGCGGACAA 
TCGCCCTTCA TTTCACTCAT GGCGTTCGGC ACAGTGATGA TTTCCTACGT TGGTATCCAT
TCTGGGTTCG CGATGATAGT TACTACCCTC GTGGTAATGG CTAACGCGTC AGTGGTTTAT
TCACTTTCAA AGAGATTCAA CAAAGGAGGT GGGTATTACA CCTATGCCCT ACACACGCTG
ACCAATAACT TGGGCATAAC TACGGGCTGG ATGTATATTC TTTACTCGTT AAGCTACGGT
GGTACCTTGA TGATGGGTGG GGTCTATGTA CTTAACCTGT TAACAGGTAT TAGTCCCCTA
TACCTTACCC TTATAGTTTC AATTCTTGCC TCCACGATAG TTATTGCTGG AGTGAAGCTT
TCAGCCAAGT ACGCAGTGGC CGTGGGCATA TTGGAGATAA TAGCAATCCT AGGTCTCTCG
ATTTTCTTCA TGTATAGATC TGGGTTTGCG TTCTATAATC CTATTCCCAC TTCTCTTCCA
ATGAACCTGC CAGAGGCAAT ACTTTTCGGT ATTGGAATAC CATCGGGCTA CTCCAGCATA
GTCAGCTACC CCGAGGAGAT TGAGAACGCT TCCAAGACAG TGAGCAGAAT TTCCCTCTTA
GTTCCAGTCA TAGGAGGTGG ATTGGCATCC TTCTTCTTCT ACGCTTTAGC GGCCCTAGGT
TTCACGGGTA ATCTAGTTGA GTTGCTCACC TCAGAGTTCG GACTTGTAGG GGGTATCCTG
ATATCCGCCA TAGCCCTGAG TGATGCTGTG CTGGGAGGAA TAGCGTACCT GTTGGCTGGG
TCAAGGACTC TCTACAACAT GTCCAAAAAT GGCCATCTAA TCAGTTATCT CGCGAGGGAG
TATAAGGGTC AGCCCAAGGT GGCCGAGGTA CTAATCTCGG TGTTGGTGAT ACTCTCACTC
TCTTTTCTCT CAATGAACTT CAGTCCTCTG GTGGCGCTAG GCCTGATTGG AGGGGTATCA
GGAATGAGTA ACCTTTACAT CCATATGGCG GCTGGGGTCT CTCTCGCCAG AATGGGAAGG
AAAAAGCCCC TGAAGCATCT CCACGAAATA GCCTTCTCCG TTGTTTCCCT AGCTTTCTCG
GCCTGGGTCC TGCTCATTTC ACTGGTTCAG CTAGAGAAGT ACGTGGTTTA CTTCTTCTTG
GGTTGGATAA TTCTAGGTTT TCTCCTAGCT GAGAGCCTTG AAATGGTTAA GGAGGAAGAG
TAA
 
Protein sequence
MGSKPKISPT EVFFLSFGGQ SPFISLMAFG TVMISYVGIH SGFAMIVTTL VVMANASVVY 
SLSKRFNKGG GYYTYALHTL TNNLGITTGW MYILYSLSYG GTLMMGGVYV LNLLTGISPL
YLTLIVSILA STIVIAGVKL SAKYAVAVGI LEIIAILGLS IFFMYRSGFA FYNPIPTSLP
MNLPEAILFG IGIPSGYSSI VSYPEEIENA SKTVSRISLL VPVIGGGLAS FFFYALAALG
FTGNLVELLT SEFGLVGGIL ISAIALSDAV LGGIAYLLAG SRTLYNMSKN GHLISYLARE
YKGQPKVAEV LISVLVILSL SFLSMNFSPL VALGLIGGVS GMSNLYIHMA AGVSLARMGR
KKPLKHLHEI AFSVVSLAFS AWVLLISLVQ LEKYVVYFFL GWIILGFLLA ESLEMVKEEE