Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0465 |
Symbol | |
ID | 5105461 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 420319 |
End bp | 421287 |
Gene Length | 969 bp |
Protein Length | 322 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640506371 |
Product | formamidase |
Protein accession | YP_001190566 |
Protein GI | 146303250 |
COG category | [C] Energy production and conversion |
COG ID | [COG2421] Predicted acetamidase/formamidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0000577809 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTACACAA TTCATTCAAG TAAAATTCAC AATAAGTGGG ACAACTCCTT GTCTCCTGTC CTAACCATCA AGTCTGGAGA CGTGATAACG GTCGAGTCCA GAGAGGCATC AGATGGTCAG GTGACTCCTT CATCTTCCCC TTCCGACCTA CTGAAGCTAG ATTTCTCAAG GATTCACCCG CTTACAGGAC CTGTTGAGAT TGAGGGAGCA GAACCGGGGG ACGCACTTGA GATAGAGTTC CTGGATTTTG CGACGAAGGG ATGGGGATGG ACAGGGGTTC TGCCAGGCTT CGGATTCCTG GCCAATGAAC CCTACACCAC CCCGATTGAC CTAGCAGGTC CAGCCCTGAA AATATGGAAG GTGGAAAGGG AGGCAATTGC TAAGTTCGGT GACATAGAGG TAAGGGTTCC ATCAAGACCC TTCCCCGGGG TTATAGGTAC TGCCCTTCCT ACACCTGGTA AATTCAGCAC AATACCTCCA AGGGAGAACG GTGGAAACAT GGACATTAAA CACCTGACCA AGGGTACGAA GCTTTATCTC CCCGTGTTTG TGAGTGGGGG TCTCCTATCC CTTGGGGACA CACACGTGGC ACAAGGAGAC GGAGAGGTAT GCGGGACTGC AATAGAGGCT CCTATGGACG TGACAATTAA GGTCACCTTG CACAAGAACG CTGGGATTAC TCAACCCCTC TTTGAGACCC CTGCAGTCAA GGAAGGCGAC TTCAAGGAAT ACCTGGCATA CCCTGGAATA GATCCCAACT TATGGGAAGC GGCTAAGAAG GCAATTAAGG GGATCATTGG CATTCTCTCC TCTCACATGA CCCCTGTAGA GGCTTACATG CTCGCTAGCG CCGTTGTAGA CCTTAAGGTA AGCCAAGTGG TCGACGTTCC GAACTGGATA GTGACGGCAT ACCTGCCTAA GGACATATTC CCTGAGGAGA TAAGGCCTAA ATTGAGGTTA ATCAGATAG
|
Protein sequence | MYTIHSSKIH NKWDNSLSPV LTIKSGDVIT VESREASDGQ VTPSSSPSDL LKLDFSRIHP LTGPVEIEGA EPGDALEIEF LDFATKGWGW TGVLPGFGFL ANEPYTTPID LAGPALKIWK VEREAIAKFG DIEVRVPSRP FPGVIGTALP TPGKFSTIPP RENGGNMDIK HLTKGTKLYL PVFVSGGLLS LGDTHVAQGD GEVCGTAIEA PMDVTIKVTL HKNAGITQPL FETPAVKEGD FKEYLAYPGI DPNLWEAAKK AIKGIIGILS SHMTPVEAYM LASAVVDLKV SQVVDVPNWI VTAYLPKDIF PEEIRPKLRL IR
|
| |