Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpal_0093 |
Symbol | |
ID | 7272263 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosphaerula palustris E1-9c |
Kingdom | Archaea |
Replicon accession | NC_011832 |
Strand | + |
Start bp | 102671 |
End bp | 104206 |
Gene Length | 1536 bp |
Protein Length | 511 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 643568751 |
Product | Nitrogenase |
Protein accession | YP_002465210 |
Protein GI | 219850778 |
COG category | [C] Energy production and conversion |
COG ID | [COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.061617 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTAACCA TTGAAGAACC GTTGATCCAA AAGATTGATC TCTCCCAGCC CACCTCTCCG AACCGCGAGC AACGGGCCTC CGGGATCAAT GTCTACTACG GAAAGGCAAG CCAGCTTACG GAGGATGCAC GGAACGGATG CCTGAGAGGT CATAATCGTA AATTTCAGCA GACCTCGGGA TGTGCCCTGA ACTTCTACCT CTCTGTTCGG ATCGGTACGA TTCGGGATGC AGTCGCGATC TACCATGCGC CGGTCGGTTG CTCTGCCGCG GCCCTCGGGT ACCGGGAGAT CTACCGGGGG GTCCCGGTGG AGCTTGGCAG ACCTGCAAAC TTTGAACTTC ACTGGATCAC GACGAATCTC CGTGAGAACG ATGTGGTCTA TGGTGCAGGA GAGAAACTGA AGGAAGCTAT CCGGGAAGCC GAGCGGAGGT ACTCCCCCAA GGCGATCTTC ATCATGACCT CCTGCACCTC AGGGATCATC GGTGAGGATA TCGAAGGAGT CGTCGCAGAG ATGCAGCCGG TGACCAAGGC CGTGCTGGTC CCGATCCACT GTGAGGGGGT CAGGTCCCGT CTGGTCCAGA CCGGGTACGA CGCGTTCTGG CATGGCGTGC TGAAGTATCT GGTCAGAAAA CCGGAGAAGA AGCAGAAGGA TCTCGTCAAT GTGGCGAGTA TGCTCTCCTA TACCTGGCAG GACCGGCTTG AGATCAAGCG GCTGCTCACC AAACTCGGTC TTCGGGTGAA CTTCATCCCG GAATTCGCCA CCGTCGAGGA ACTGCAGCAG CTCTCCGAGG CCGCGGTCAC CGCCCCGATC TGCCCGACCT ACACCGATTA TCTCTCCCGT GGTCTTGAAC AGGAGTACGG AGTTCCGTTC TTCCTCTATC CCTCGCCGAT GGGGATTGCA AATACAGACG CCTGGCTGCG GGAGATCGCA AAGTACACCG GCAAAGAGGT CGAAGTCGAG CAGTTGATCG AGGAGGAGCA TAAGCGCTGG GTTCCGCAGA TCGAGCGGAT CAAGGACGAA CTGGCCCATA TCAAGAAGGA CGGTTCGAAG ATCTCTGTAC TCGGCTCACT CGGACAGGGG AGGTTACTTG CTCAACTTCC CTACTTCGAC GAACTCGGGC TTTCATCTCC TGCGGCGATG TGCCAGGACT TCGATAACCT GATCCTCGAG GAGCTCGAAG GGCTGATCAA AAAGTACGGG GACTTCAACA TCCTGGTCAA CACCTTCCAG GCTGCAGAAC AGTCCCATAT CACCAGGGAG CTCGACCCGG ATATCACCCT GACCTGCCCG TTCCAGGGCG GGGCCTTCAA ACGGAAGAAG GGGGTGACCA GGATCCATGC GCTCCGTGGT GACGCAAGCC CCTGGTCCAC CCAGTCGGGC TATGCCGGGG CCGTGGCCTT CGGCAACTTC CTGCTTCAGG CACTCAAGAG TGGTGCATTC CAGGAGTTGA TGCTCAAAAA GACCGAGGAC AGTTACAAGC AGTGGTGGTA TCAGCAGCCC GATCCGCTCC ACTACCTCGC CAGGGAGGAA GAATGA
|
Protein sequence | MVTIEEPLIQ KIDLSQPTSP NREQRASGIN VYYGKASQLT EDARNGCLRG HNRKFQQTSG CALNFYLSVR IGTIRDAVAI YHAPVGCSAA ALGYREIYRG VPVELGRPAN FELHWITTNL RENDVVYGAG EKLKEAIREA ERRYSPKAIF IMTSCTSGII GEDIEGVVAE MQPVTKAVLV PIHCEGVRSR LVQTGYDAFW HGVLKYLVRK PEKKQKDLVN VASMLSYTWQ DRLEIKRLLT KLGLRVNFIP EFATVEELQQ LSEAAVTAPI CPTYTDYLSR GLEQEYGVPF FLYPSPMGIA NTDAWLREIA KYTGKEVEVE QLIEEEHKRW VPQIERIKDE LAHIKKDGSK ISVLGSLGQG RLLAQLPYFD ELGLSSPAAM CQDFDNLILE ELEGLIKKYG DFNILVNTFQ AAEQSHITRE LDPDITLTCP FQGGAFKRKK GVTRIHALRG DASPWSTQSG YAGAVAFGNF LLQALKSGAF QELMLKKTED SYKQWWYQQP DPLHYLAREE E
|
| |