Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1393 |
Symbol | |
ID | 5104603 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 1365939 |
End bp | 1367024 |
Gene Length | 1086 bp |
Protein Length | 361 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640507282 |
Product | peptidase M28 |
Protein accession | YP_001191475 |
Protein GI | 146304159 |
COG category | [R] General function prediction only |
COG ID | [COG4882] Predicted aminopeptidase, Iap family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0177008 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 0 |
Fosmid unclonability p-value | 0.000000310034 |
Fosmid Hitchhiker | No |
Fosmid clonability | unclonable |
| |
Sequence |
Gene sequence | ATGGAATTGG CCAGACAACT ATTCAACTTA GGTGAAGCGA TCCATGGCGG GCCTGAGGAA TTGAAGATTC TCGAAAAACT TGAGGAGTTA TTTCCAGACT ACGAATCAAT CCCCGTCAAT ACAAAGTTCT GGGATGTCAG ATTCTCTGAG ATCCTAGCTA ATGGTCAAAA CATACCCTCG GTAGCAATGC CCTACACCTC AGGCTGTGTT AAGGGTAGAG TGGGAAGAGA GATAGGTTTG TTTCCAATGC CTTCCCATCC TTTTGATCTC AAGAACTTAC CGCTTTCCCA GTATGAGGGA GTGATAATTG TGGAGGAGGG AAAGCTGAGA AGAATAACTC TTCCGCAAGG CTCACCTCCT ACGTTCTTTG CGTCTAGGAA TGTAGACGGT TATGTCGAGC TTTGCTCTGA TACAAGGCTT GTTGAGGCGA ACTCCAGAAA CCTGGAGATC ACCTTAAGGG AGGGAGACTC CTACATTCTG CTTGGCGCTC ACGTGGATCA CTGGTTATCT GGTTTTCACG ACAATATCTT ATCCGTTCAA CTCTTGGTGG ACATGAAGAA AGACCTGGAA AGATCTAACC TAAGGCATGG GGTCAAGCTG GTTTTCTTCT CCTCAGAGGA AGGGCCAAGA TGCTGTACAG GTTCATCACA GTTTCCTGTA AAGGACGCAT TTGCGGTGAT ATCCCTAGAC GCTATCTATC CATCTAGGGT TGTATTCTCA GGAACCCCTG ACCTATGGTT CCTGTCTAAA CATTTTCCCT TGAAGCGAGT GGAAATGCCT ACACCGTTCT CTGATCACTA TCCCTTCGTA CAGAGGGGGA TCCCCGGTCT CGTGCTCTAT AATGATGACA TGACCACAGT CTACCACTCG GATGCAGACG TTCCAACTCC CCTTGACCCC CAGTACCTTG AAGTTTTGAG GAAAAGTCTT GTTGAGGCTT TGCGGGAATT GGATTCTACC CCTAGCGACA GGCTCGATGA AGAATTCTTT AGACATGCTA AACTCGCCGG TTACACTGGG GATACTAGGG AGGGGGCACT GATTCCAGAT CCATCAACCT TGACTACCAA GTTTAAAAGA ATCTAG
|
Protein sequence | MELARQLFNL GEAIHGGPEE LKILEKLEEL FPDYESIPVN TKFWDVRFSE ILANGQNIPS VAMPYTSGCV KGRVGREIGL FPMPSHPFDL KNLPLSQYEG VIIVEEGKLR RITLPQGSPP TFFASRNVDG YVELCSDTRL VEANSRNLEI TLREGDSYIL LGAHVDHWLS GFHDNILSVQ LLVDMKKDLE RSNLRHGVKL VFFSSEEGPR CCTGSSQFPV KDAFAVISLD AIYPSRVVFS GTPDLWFLSK HFPLKRVEMP TPFSDHYPFV QRGIPGLVLY NDDMTTVYHS DADVPTPLDP QYLEVLRKSL VEALRELDST PSDRLDEEFF RHAKLAGYTG DTREGALIPD PSTLTTKFKR I
|
| |