Gene Msed_1503 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1503 
Symbol 
ID5104032 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1467454 
End bp1469019 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content48% 
IMG OID640507391 
Productamino acid permease-associated region 
Protein accessionYP_001191584 
Protein GI146304268 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0531] Amino acid transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCAGTA AAAGAAACGT CTTTATAAGG GAAAGTTCTG GTCTCCTTAA GCAGGTCAAT 
CTTCTTGACG CTGTGATGCT CAACATTGGT AATATGTCAG CTGGCGTCAC GCTATTTGAG
TCGATCTCCC CTTACATAAA CAACTACCCT GGTGGAGTCC TATGGCTCGC GTCAATAATA
GGCCTGGTTT TTGCGTTTCC ACAGCTCCTG GTTTATACCT ACATGACACG AAAGATGGGA
AGGACTGGCG GTGATTATGT CTGGATAAGC AGGAACTTGA ATGGGGCTAT TGGATCCACG
ATGGCGATAG CGCTCATGCT TGAGTCGGTT GCCTACTTTG CATTAGTGGC CTTCTTCTCG
GCGTCCAGCA TTAACGCAGT GTTATATACC ATTGGCAGTG TGGACAACTC CCCAAGCTTA
GTGTCCCTCT CAAACAACGT GTTTGTTAAC CCCTATAACG GCGGTCTCAC CTTTGAGCAG
AAGGCCCTAT TCTACGGAAT AGCTGCGGCG TTCTTCGTGA TCGTCATCCT GCTGAACATT
TTCAGATCCA GATGGGGTTA CTCCATTGTG ACAGGTTTCG GTATAGTATC GCTTTCAACC
CTTGTCATAG CGATGATCGT GATAGGAGCC TCGGCTGGGA GATTTGGAAC AGCCATAACC
CCGTTCCTTA ACTCTATCAA TTCAAGCTTA GTTAACGTTT ATCAATCCTC ACCACACACG
GCCTTCCCCA CGAACTTTAG CATAGTTTCA ACGGTGCTAC TATTACCGCT TTTCGCCCTA
TACACATACC CGTGGATGCA AGCTGGACCT GCGGTATCGG CTGAGTTTAA GCAAAGTGAT
AGGGTCGCCA AGTTCAACCT AGTGTTTGCC CTTCTTCTCA CAGCTATTCT TGTTACGGGA
GGCTTCCTGG AGATGGATCT GGTTGCGGGA TATCCCTTCA ACTTTGTTGC CTATCCCTAT
TTCATTTACA ATTTCTGGAC TGTTGCCATT GCACTGGCAG GAAATCCAGC CCTTCAATGG
CTCATTGGCA TAGGTGCCAT AGCCTGGAAC TTCTTCGTTT TAGCGTATGG TATAATAGTG
TTCTCCAGGT ACGTGTTCGC GCTCTCCTTT GACAGGATTC TTCCGGAGAA GTTCGCGGAG
GTAAACAGGT TCGGTTCACC CGTTTACGCC CATGCCCTCG ATTTAACCAT AACCCTACTA
TTCCTCCTGG TTCCAGTGTT CTCACTCAAT GCTGCCCTCT CGCTTTATGG AGCAACTATC
CTTGGCTCAA TCTATTTCCT AGTGGCCAGC ACAGCAGGTG CAATTTATGG TCTAAGAAAC
AGGGCCAAGG CGATATCCGT GGCTGGTGTA ATCTCGGCCC TCTACTTTGC CTTCCTTACA
TATGAGGCTG CCACTAACCC ACTGTTTGGC TTTACCACAT CAACAGGCTC GGTCAACTTG
ACCACATTGA TATTCGTGGT AGGGGTACTC GTAGTTGGCT TCCTGGTTTA CCTGGTATCT
AACTACAGAA ACAAGAAGAA GGGAATAGAT ATTTCTCTAG TGTTCAAGGA AATTCCTCCA
GAGTAG
 
Protein sequence
MSSKRNVFIR ESSGLLKQVN LLDAVMLNIG NMSAGVTLFE SISPYINNYP GGVLWLASII 
GLVFAFPQLL VYTYMTRKMG RTGGDYVWIS RNLNGAIGST MAIALMLESV AYFALVAFFS
ASSINAVLYT IGSVDNSPSL VSLSNNVFVN PYNGGLTFEQ KALFYGIAAA FFVIVILLNI
FRSRWGYSIV TGFGIVSLST LVIAMIVIGA SAGRFGTAIT PFLNSINSSL VNVYQSSPHT
AFPTNFSIVS TVLLLPLFAL YTYPWMQAGP AVSAEFKQSD RVAKFNLVFA LLLTAILVTG
GFLEMDLVAG YPFNFVAYPY FIYNFWTVAI ALAGNPALQW LIGIGAIAWN FFVLAYGIIV
FSRYVFALSF DRILPEKFAE VNRFGSPVYA HALDLTITLL FLLVPVFSLN AALSLYGATI
LGSIYFLVAS TAGAIYGLRN RAKAISVAGV ISALYFAFLT YEAATNPLFG FTTSTGSVNL
TTLIFVVGVL VVGFLVYLVS NYRNKKKGID ISLVFKEIPP E