Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1138 |
Symbol | |
ID | 5103486 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1074669 |
End bp | 1075916 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640507030 |
Product | AAA family ATPase |
Protein accession | YP_001191223 |
Protein GI | 146303907 |
COG category | [R] General function prediction only |
COG ID | [COG1373] Predicted ATPase (AAA+ superfamily) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.631516 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAGTTG AGGACTTCAA GGCGGTGATT GCTGAATTCC TTAACTCGGA GATACCCAAG ACCACGGATA GGGAGACCAG GTTGCCCCTT GACACAAACT ACGTGATCAC GTTGACCGGT GGAAGAAGGG TTGGGAAGAC CTACATCCTC TACAACACCA TGTCCAGGTT AGTTTCCGAG GGCAAGGCCT CCAAGGACGA GATAGTTTAT GTGGATTTCG AACACCCCAG GCTGAGGAAC TTAAGTGCCG TTGATCTTGA CGACATATTG ACTGCCTTCT ACGAGTTAAC AGGGAAGAAG CCAAGGTATC TCTTCCTCGA TGAGATACAG ACCGTGAAGG ATTACGGGAG TTGGTTCAGG AGGAGGCTTG ACGCGAGGGT TTTCCTGACG GGGTCCTCCT CCGCATTAAC TCCCTCAAGG ATAGCCGAGG AGCTCAGGGG AAGGAGCCTG AACTTTGAGG TCTTCCCCCT CTCCTTCAGG GAGTACCTGT CCTTTCTGGG AGTACGTGTG AACCCTGAGA TCACCCTATA CACAGAGGAA AAGGGGAAGA TCCTATCCCT ATTGAGGGAG TATCTCAGGT ATGGCGGATA TCCAGCGGTG GTCCTTGAGA GGGACCCAGG TCTCAAGAAG ATGCTGTTAC GATCCTACTT CGACTCCGTC GTGGTGAGGG ACCTGAACGA GAGGTATGCC GAAACCTTTG CCTCCTACAT CGTGTCAAAT TACTCGTCGC TGATCTCGTA CAATAGGGTT TACAACTACC TGAAAACCCT GGGTTTCAAG GTAAGTAAGG AGAAGGTGAT CGAACTCTTT CGCAGGGGGA GGGAGGCGTA CTTCCTGTTC GAGGTGGAGG TGTTTGAAAG GAGCGAGACT AAGAGGAAGG TGAATCCTAG AAAGGTCTAC ATCGTGGACA TGGGTTATCC CTACGCCTTG GGGTATGACT CAGTGTCTAA GGCTATGGAA AACGCGGTCT ACCTCCAGTT GAGGAGGGAG GGGAAGGAGG TGTATTACTG GAGATCTGAG GACGCGGAGG TGGATTTTGT GGTGAGTGAG AAAATGGAAC CTAAGGAGCT CATACAGGTG ACCTACGCCG AGGACAAGAT AGAGGACAGG GAGGTGAAGG GATTGAGAAA GGCTGAAAGG GAGATCAACG CGGAAAGGTC CACGATCATA ACCTGGAGCT ACCAAGGGAG GGTCAACGGT TATCAGGCAG TTCCTCTTTG GTATTGGTTA TTAAGGAGAG AGAGATAG
|
Protein sequence | MRVEDFKAVI AEFLNSEIPK TTDRETRLPL DTNYVITLTG GRRVGKTYIL YNTMSRLVSE GKASKDEIVY VDFEHPRLRN LSAVDLDDIL TAFYELTGKK PRYLFLDEIQ TVKDYGSWFR RRLDARVFLT GSSSALTPSR IAEELRGRSL NFEVFPLSFR EYLSFLGVRV NPEITLYTEE KGKILSLLRE YLRYGGYPAV VLERDPGLKK MLLRSYFDSV VVRDLNERYA ETFASYIVSN YSSLISYNRV YNYLKTLGFK VSKEKVIELF RRGREAYFLF EVEVFERSET KRKVNPRKVY IVDMGYPYAL GYDSVSKAME NAVYLQLRRE GKEVYYWRSE DAEVDFVVSE KMEPKELIQV TYAEDKIEDR EVKGLRKAER EINAERSTII TWSYQGRVNG YQAVPLWYWL LRRER
|
| |