Gene Msed_1819 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1819 
Symbol 
ID5105382 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1765899 
End bp1767323 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content51% 
IMG OID640507718 
Productamino acid permease-associated region 
Protein accessionYP_001191897 
Protein GI146304581 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0531] Amino acid transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.757233 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGAGA AAAAGAAACT GGAAAATGAG GCTCAACCAA GCCTGAAGAA GGACGTGTTG 
GGGACGTGGT TAGTGGCAAG TTACGGTATA GCTGCAAACG CTCCCATTGC AGTTGCCACG
CTGTATTTCG TGGGGATTGC TGGTATAGCT GGAGGTGCCA TGCCCCTAGT GGTTCTGCTT
TCCTACCTGA TTTACGCTAC CACACTGATC GTGATATACG AGTGGAGCAA GGACGTGGCT
TCATCTTACG GCTACGTGGC CATCATGAAG AAGGGTCTTA ACAGCAGTTT AGCTGCTTTC
ACCGTGGGAT ACGGTTACAT TTATCAGTAC CTGGTCGCAG GGACTGCCGG TTTCGGCATA
CTCGGCCTAG CTTCCTTCCT TTACTTAATA TCGCCAAGTA TAGCCTCAAC GATGCCGTGG
CTGTGGGCCC TGATCACGGT GATCCTGACG CTCGAGGTTA CCCTTGTAAT GTGGTTGGGC
GTGAAGCCCG GTGGTCTGCT CAACCTCGTG ATAGGTCTAT TTTCCATTGG ATTTCTCGTG
GTAACGTCTA TCTCTCTCAT AGCCGTTGCG GGAAGTCACA ACACCGTAAG CGTGTTCACC
GCATCCCCGG TGAACAACAA CTGGGTTCTA ATACTGGTTT CAATGATCTT TGCGATCACG
ACGTTTGGAG GAGCCACGAC TCCCATAGGC GTTGCCGAAG AGGCCAAGGT ACCAAAGAGA
ACCATGCCCA GGGCACTCCT CCTCGGGTTT GGACTACTTG GAGTGGGGCT AATTCTCAAC
TCCTATGCGC AGACCGTGAT CTACGGAGTG TCCAACATGT TCAACTACGC CTCTCTCCCT
GATCCCATGG TGATCATCTA CAGCAAGTAT TTCAGTCCCG TGATTGTGGA TCTACTAATA
GTACTGGTTG CATTCATGTT CAACTCCTCT ATCATTTCCT TTGCGACCAG CGGTAGCAGA
ATGATATACG GGATGGCAAG GGACGGAATA CTCTATCCAA GCAACTTCTC GAAGGTGAAC
AGGCACGGGG CTCCCGGTAA CGCAATAATA TTGACTGGAG TTATCGCTGG GGCACTTTGC
CTGCTAACCG GTTACCTTCT AGGTCCCCTG GAGGCCAGCA TCTTCCTAAT AACCTTCGGC
TCATTTTACG TTTCTCTCGG GCACCTGTTT GCTGCCTTGG CTCTCATTAG AAGAAAGGTG
AAGCTGGGGA GGCCAGACAT CGCCAAGCAC GTGTTAATCC CCATAATCTC CATGGGCGCT
TACGTGGCTA CGATATACTT CGGAACCTAT CCGGCACCAG CGTTTCCCTT GAACATAGCA
GTGTATTCGG CCTGGGCCGT GTTGGCGGTT CACGTCGTAG TGTATTATTT GATGAAGAGA
AGATATCCGG AAAGGCTAAG CAAGTTCGGG GATCACAGTC TATAG
 
Protein sequence
MDEKKKLENE AQPSLKKDVL GTWLVASYGI AANAPIAVAT LYFVGIAGIA GGAMPLVVLL 
SYLIYATTLI VIYEWSKDVA SSYGYVAIMK KGLNSSLAAF TVGYGYIYQY LVAGTAGFGI
LGLASFLYLI SPSIASTMPW LWALITVILT LEVTLVMWLG VKPGGLLNLV IGLFSIGFLV
VTSISLIAVA GSHNTVSVFT ASPVNNNWVL ILVSMIFAIT TFGGATTPIG VAEEAKVPKR
TMPRALLLGF GLLGVGLILN SYAQTVIYGV SNMFNYASLP DPMVIIYSKY FSPVIVDLLI
VLVAFMFNSS IISFATSGSR MIYGMARDGI LYPSNFSKVN RHGAPGNAII LTGVIAGALC
LLTGYLLGPL EASIFLITFG SFYVSLGHLF AALALIRRKV KLGRPDIAKH VLIPIISMGA
YVATIYFGTY PAPAFPLNIA VYSAWAVLAV HVVVYYLMKR RYPERLSKFG DHSL