Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpal_1940 |
Symbol | |
ID | 7270744 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosphaerula palustris E1-9c |
Kingdom | Archaea |
Replicon accession | NC_011832 |
Strand | - |
Start bp | 2055888 |
End bp | 2056958 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 643570554 |
Product | UBA/THIF-type NAD/FAD binding protein |
Protein accession | YP_002466967 |
Protein GI | 219852535 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.124053 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.423901 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACACTCA CAGAAAAAGA TGAAACCGGA TCCGTTCTGG ATTCCCAGAA TTCGACGAAG AACCGTTACT GCCGCCAGAT CATCCTTCCT GAGATTGGGA CAGAGGGGCA ACGCCGATTC AAAGAGAGCC GGGCAGTCAT TGTGGGTCTT GGAGCAACAG GAAGTTCCGT GGCCAACTCA CTGGTGCGGG CGGGTATCGG ACAGGTAGTT TTGATCGATC GTGATCTGGT GGAGCTGCAC AACCTGCAGC GGCAGATCCT GTACAGCGAG GAAGATCTGA ACCGCCCAAA GGCTGTTGCC GCTGCTGAGA TCCTGCGAAA GATCAATTCT TCCATTGAGA TTGAGGCACA TGTAACAGAT TTCAACATAT CAAATGCCGA AAAACTCCTA TCAGGAGCGA ACGTGGTTCT GGATGGAACC GACAACCTCC AGACCCGTTT CCTCATCAAC GATATCTGCG TAAAGCATAG CATCCCCTGG ATCTATGCCG GCGTTGTCGG GACCGGGGGC ATGGTGATGC CCATCCTGCC GGGCAAAACT CCCTGTTTCC GGTGTCTCGT TCCTTCGCTT CCCGGACCCG GCCTGTTGCA GACGTGCGAC ATTGCAGGTG TCTTAAATAC GATGCCTCCT CTCATCGCCT CTATTGAGTG TACCCTCGCT TACCAGATCC TGACGGGGCA GTTCGACCCG AAGGATGAGA TCTCGTACAT GGTATATATT GATGGATGGC GGAACACGTT CGATCGAGTG GCGGTTGGGA GACAACCAGA CTGTCCGTGT TGCGTACAGG GTCAGAGGGA TTTCCTGGAT GCGGTCTCCC GGGAGATGGT AACATCACTC TGTGGGAGAG ACGCGATCCA GATCATCCCC TCATCCCAGA TGGAGATCAC TCTTGAAGAC CTTGAGATCC GTCTCTCCCG TCTTGGCGAG GTACGTCTCC ACCCATACAT GCTCACATTC CGGACCGGAA CCGAAGAGAT CTCAATATTC CGTGACGGAA GGGCGATCAT CAAAGGAACC AAAGACGAGG CTATGGCCCG TTCCGTTTAT GCACGGTACA TAGGGCTCTA G
|
Protein sequence | MTLTEKDETG SVLDSQNSTK NRYCRQIILP EIGTEGQRRF KESRAVIVGL GATGSSVANS LVRAGIGQVV LIDRDLVELH NLQRQILYSE EDLNRPKAVA AAEILRKINS SIEIEAHVTD FNISNAEKLL SGANVVLDGT DNLQTRFLIN DICVKHSIPW IYAGVVGTGG MVMPILPGKT PCFRCLVPSL PGPGLLQTCD IAGVLNTMPP LIASIECTLA YQILTGQFDP KDEISYMVYI DGWRNTFDRV AVGRQPDCPC CVQGQRDFLD AVSREMVTSL CGRDAIQIIP SSQMEITLED LEIRLSRLGE VRLHPYMLTF RTGTEEISIF RDGRAIIKGT KDEAMARSVY ARYIGL
|
| |