Gene Msed_1312 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1312 
Symbol 
ID5104563 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1289326 
End bp1290678 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content47% 
IMG OID640507201 
Productamino acid permease-associated region 
Protein accessionYP_001191394 
Protein GI146304078 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0531] Amino acid transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.3758 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAAGA AAGAACTTGC AAAGGGATCT GTTTCATATA GAGAAGTAGT GGCTCAAGGA 
GTGGGTGGAG CGGCCCCGGC AATGGCCAGC CTAGTGACGC TAACAGGGGC TGCAGCCTAC
GCCTATGGTT CTTTCCCGCT AGCTGTACTT TTAGCAACGG TAGCGGTACT CCTTGATGCT
ACTAGGTTAT CCATTACAAG TAGATACGTT CAAAGTGCTA GAGGGATTTA CGCATTCATC
TCAGAAGGTT TGGGAAAGAA AGTTGGTTAT TTTGTGGGTT GGGCTTACGT TCTCTACGCT
CTTACGGCTC TAGTTTTTAT CTACCTCTCC ATAGGAGTAT TTCTGCCAGG TGCATTGCAG
GTACTTGGGA TTAATACTCC GGGATGGATA TGGGCTCCCC TAGTCGTGGC CGTCGCGCTA
TTTGGAGGAA TCCTCTCCTA CCTTGGAATA AGACCATCCC TCAAATTTAC GCTCACCATG
AGCATCCTTG AAATAGCGTT CATTCTGGGA ACTTCCCTCT TAATCTTCAC TAAGGTGTCG
CCAGATCCTG CCACCTTTAC TCTCAAGTAC GCCCCACCGC CATCCGCATT CAACGTCGGA
GTAGGCATGG CTTTCGTGTT CCTGGCCTTC GCGGGATATG AGACTACCTC AGTCCTAGGA
GAGGAGGCTG TGGACCCTAA GAACACCATA ACTAAGGGTG TCTTCACCAG TGCCTTGCTC
GTGGGGATTA CCTACCTTAT GGCCAGCGAA GCCTTCACTG TGGGTTGGGG GGTCAACGAC
ATGTCATCCT TCTTTAGCCA ACTTGTTCCA GGCATCGTCC TGGGAATGAG ATATGGAGGT
TTCGTCCTGG CAGTTATCCT AACGATTCTG CTCATAAACA GCGGACTAAC AGACTCTGTA
ACTTTCTTCA ACACAGTATC CAGGGTGGTC TACGCCATGG CTAGGGACGG CGTCCTAGAT
AAGAGATTGG AGGGAATACA TGATAACAAC AGAACTCCCC ACGTAGCCAT CCTCTTCTCC
CTTGCCTTCT CTCTTCTATA CACTCTCATC TTCTCAGCAG CGATAGGGCC AGCTAACGTT
TTCTTATCAG TTGGTATCAC CACAACGTTC GGTTTCCTGA TTGCCATATT TACTGCAAAC
ATTAGTCTAT TATTCATCTT AAGAAGGTTT AGCGCACTTA ACGTGTGGAA CGTTCTTCTC
ACGGTGATCA TAAATGCGAT TCTAGGATTC GTAATATTTG CCAACATAGT TACAACTGCA
GTCAATTCCT TCGTTCTCAT TGGAGTTGCT ACATTCGCCG GCTGGATGAT AATCGGGGCA
ATTTATTATT GGTTGAGAAA AGTAAGAGTA TAA
 
Protein sequence
MSKKELAKGS VSYREVVAQG VGGAAPAMAS LVTLTGAAAY AYGSFPLAVL LATVAVLLDA 
TRLSITSRYV QSARGIYAFI SEGLGKKVGY FVGWAYVLYA LTALVFIYLS IGVFLPGALQ
VLGINTPGWI WAPLVVAVAL FGGILSYLGI RPSLKFTLTM SILEIAFILG TSLLIFTKVS
PDPATFTLKY APPPSAFNVG VGMAFVFLAF AGYETTSVLG EEAVDPKNTI TKGVFTSALL
VGITYLMASE AFTVGWGVND MSSFFSQLVP GIVLGMRYGG FVLAVILTIL LINSGLTDSV
TFFNTVSRVV YAMARDGVLD KRLEGIHDNN RTPHVAILFS LAFSLLYTLI FSAAIGPANV
FLSVGITTTF GFLIAIFTAN ISLLFILRRF SALNVWNVLL TVIINAILGF VIFANIVTTA
VNSFVLIGVA TFAGWMIIGA IYYWLRKVRV