Gene Msed_1610 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1610 
Symbol 
ID5103974 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1556523 
End bp1557587 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content45% 
IMG OID640507500 
Productpeptidase M24 
Protein accessionYP_001191689 
Protein GI146304373 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.75509 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGGCTCA ACAAGCTTGA GAAAATAAAG GAAAAGTCTA ACGCGAAAAA CCTGCTAATT 
GTCGGGGAGC CTAACCTGTT CTATTTCACG GGATATAGGG GAGTTGGTGG CTTACTGGAC
TGTGATGGTA CAAGGACCCT ACTGGTGCCA TTACTGGAAA GGAATAGGGC CTTGGGAATC
AAGGACTTGG ATGTGAAGGT ATATTACCCG GTAAAGCTCG AGGAGAACGT AATAGAGGGA
ACGCTAGTTT CAGCAGTAGA AAAACTCTGT CCTTCCACGA CAGATAAAAA GCTCTTGATA
GATTTGGGTT ACGCCTCTGT GGATCTATTC CTTCAGTTGA GTTCTAAGTA TGAAGCGAAG
AACATCACAG AAGATATACT CCAGACGAGG GCAATAAAGG AGGAAAAGGA AATTGAGGCT
ATCAGGCATG CTCAGAGGGC AACCGCCATG GCCATGAAGA TGGCAAGCGA GTCTCTAGTA
GAGGGAATAT CCGAAATTGA ACTTGCAGGC ATAATTGACG AGACCATGAG AAAGGGTGGT
GCTGAGGACT ATGCTTTTCC CTCTATAGTC GCCTTTGGTG AAAATTCGGC TGAACCTCAC
CATATTCCAT GCGAAAGAAG GCTGAGAAAG GGTGATACAG TAGTGGTAGA TATAGGGGCT
AAATACAATG GATATTCCTT TGACAGCACA AGGACATTCC TGTACGGAAT CACAGAGAAA
AGCAAGAGGA TATATGACGT GGTTCTTGAG GCACAACTAG AGGCAATCGA CGCAGTCCAG
GAAGGAATAG AGGCGTCTCA AATCGATAGG ATAGCCAGAT CCAGGATTGA GAAGGAGGGT
TTCGGAAAAC TATTCGTTCA CTCCACGGGA CATGGGGTGG GAATCGAGGT CCATGAAAGC
CCAGCAATTT CCATGAAGTC TAAAGACATC CTAAGGGAAG GTATGGTAAT AACGGTAGAA
CCAGGTATAT ACTTCCAAGG TGAACTGGGC GTTAGAATAG AAGATACAAT CCTTGTCAGA
AAGGGTAAAC CGGAGGTCCT TGAGACCCTT TATAAGACCT TGTAA
 
Protein sequence
MRLNKLEKIK EKSNAKNLLI VGEPNLFYFT GYRGVGGLLD CDGTRTLLVP LLERNRALGI 
KDLDVKVYYP VKLEENVIEG TLVSAVEKLC PSTTDKKLLI DLGYASVDLF LQLSSKYEAK
NITEDILQTR AIKEEKEIEA IRHAQRATAM AMKMASESLV EGISEIELAG IIDETMRKGG
AEDYAFPSIV AFGENSAEPH HIPCERRLRK GDTVVVDIGA KYNGYSFDST RTFLYGITEK
SKRIYDVVLE AQLEAIDAVQ EGIEASQIDR IARSRIEKEG FGKLFVHSTG HGVGIEVHES
PAISMKSKDI LREGMVITVE PGIYFQGELG VRIEDTILVR KGKPEVLETL YKTL