Gene Msed_1393 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1393 
Symbol 
ID5104603 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1365939 
End bp1367024 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content47% 
IMG OID640507282 
Productpeptidase M28 
Protein accessionYP_001191475 
Protein GI146304159 
COG category[R] General function prediction only 
COG ID[COG4882] Predicted aminopeptidase, Iap family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0177008 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000310034 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGGAATTGG CCAGACAACT ATTCAACTTA GGTGAAGCGA TCCATGGCGG GCCTGAGGAA 
TTGAAGATTC TCGAAAAACT TGAGGAGTTA TTTCCAGACT ACGAATCAAT CCCCGTCAAT
ACAAAGTTCT GGGATGTCAG ATTCTCTGAG ATCCTAGCTA ATGGTCAAAA CATACCCTCG
GTAGCAATGC CCTACACCTC AGGCTGTGTT AAGGGTAGAG TGGGAAGAGA GATAGGTTTG
TTTCCAATGC CTTCCCATCC TTTTGATCTC AAGAACTTAC CGCTTTCCCA GTATGAGGGA
GTGATAATTG TGGAGGAGGG AAAGCTGAGA AGAATAACTC TTCCGCAAGG CTCACCTCCT
ACGTTCTTTG CGTCTAGGAA TGTAGACGGT TATGTCGAGC TTTGCTCTGA TACAAGGCTT
GTTGAGGCGA ACTCCAGAAA CCTGGAGATC ACCTTAAGGG AGGGAGACTC CTACATTCTG
CTTGGCGCTC ACGTGGATCA CTGGTTATCT GGTTTTCACG ACAATATCTT ATCCGTTCAA
CTCTTGGTGG ACATGAAGAA AGACCTGGAA AGATCTAACC TAAGGCATGG GGTCAAGCTG
GTTTTCTTCT CCTCAGAGGA AGGGCCAAGA TGCTGTACAG GTTCATCACA GTTTCCTGTA
AAGGACGCAT TTGCGGTGAT ATCCCTAGAC GCTATCTATC CATCTAGGGT TGTATTCTCA
GGAACCCCTG ACCTATGGTT CCTGTCTAAA CATTTTCCCT TGAAGCGAGT GGAAATGCCT
ACACCGTTCT CTGATCACTA TCCCTTCGTA CAGAGGGGGA TCCCCGGTCT CGTGCTCTAT
AATGATGACA TGACCACAGT CTACCACTCG GATGCAGACG TTCCAACTCC CCTTGACCCC
CAGTACCTTG AAGTTTTGAG GAAAAGTCTT GTTGAGGCTT TGCGGGAATT GGATTCTACC
CCTAGCGACA GGCTCGATGA AGAATTCTTT AGACATGCTA AACTCGCCGG TTACACTGGG
GATACTAGGG AGGGGGCACT GATTCCAGAT CCATCAACCT TGACTACCAA GTTTAAAAGA
ATCTAG
 
Protein sequence
MELARQLFNL GEAIHGGPEE LKILEKLEEL FPDYESIPVN TKFWDVRFSE ILANGQNIPS 
VAMPYTSGCV KGRVGREIGL FPMPSHPFDL KNLPLSQYEG VIIVEEGKLR RITLPQGSPP
TFFASRNVDG YVELCSDTRL VEANSRNLEI TLREGDSYIL LGAHVDHWLS GFHDNILSVQ
LLVDMKKDLE RSNLRHGVKL VFFSSEEGPR CCTGSSQFPV KDAFAVISLD AIYPSRVVFS
GTPDLWFLSK HFPLKRVEMP TPFSDHYPFV QRGIPGLVLY NDDMTTVYHS DADVPTPLDP
QYLEVLRKSL VEALRELDST PSDRLDEEFF RHAKLAGYTG DTREGALIPD PSTLTTKFKR
I