Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msil_3621 |
Symbol | |
ID | 7092894 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylocella silvestris BL2 |
Kingdom | Bacteria |
Replicon accession | NC_011666 |
Strand | - |
Start bp | 3986708 |
End bp | 3987742 |
Gene Length | 1035 bp |
Protein Length | 344 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643466909 |
Product | Fe-S cluster assembly protein NifU |
Protein accession | YP_002363868 |
Protein GI | 217979721 |
COG category | [C] Energy production and conversion [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0694] Thioredoxin-like proteins and domains [COG0822] NifU homolog involved in Fe-S cluster formation |
TIGRFAM ID | [TIGR02000] Fe-S cluster assembly protein NifU |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 0.237955 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGAAA GCGCCGGGAC CCCCGCCATG TTGGACTACA GCGACAAAAT CAAGGATCAT TTCTTCAATC CGAAGAATGC CGGTCCCCTC GCCGACGCCA ATGCGGTCGG AGAAGCTGGG GCGCTCTCAT GCGGCGACGC CTTGAAGCTG ATGCTGAAAG TCGACCCGCT GACAGAGGTC ATTCTCGACG CCCGCTTTCA GACATTCGGC TGCGGCTCGG CGATCGCTTC CTCTTCGGCG CTGACCGAGC TCATCATCGG CAAGACAATC GTGCAAGCCG CCGGACTGAC CAACCAGCAG ATCGCAGAGT CCCTTGGCGG CCTGCCGCCG GAAAAAATGC ATTGCGCGGT CCTCGGCCAT GACGCGCTGC AGGCGGCGAT CGCGAATTTT CGCGGCGAAG CCTGGACGGC CGGCGACGCG CCCGGCGCCC TGATCTGCAA ATGCTCCGGC GTCGGCGAAG GCGCGCTGAA GCGCGCCATC CGGATGAACA GGCTGACAAG CGTCGACCAA CTCACCGCCT TTACCAAGGC CGGCGCCGGC TGCTTCACCT GCTACGACAA GCTTGAGGCC GTCCTCGCGC AAACCAACGC CGAGTTGGTC GCCGAAGGAT TGATCGCCAA GGAGGCCGCC TTTTCTCTGC AAACCCGCGC GCCAAGGCCG GCGCCGGCCG CAGTCGCAGC GGCGCCCGAG ACCCCAGCGC AGCCGGTCTC GCGCCCCAGA GTGCAGATTT CGCCGCTCGC CCCTTCGGCG CCCGCGGGCG AACCGGCGAT GACGAATTTG CGCAAGATCA AGCTGATCGA GGAGACGATC GAGGAGCTGC GCCCGTTCCT GCGCAAGGAT GGCGGCGATT GCGAATTGAT CGACGTCGAC GGCTCGAATG TCCTCGTCAA AATGTCGGGC GCCTGCGTGC TCTGCAAACT CGCCAGCGCG ACGATCTCCG GCATTCAGGA AAAACTGGTC GAAAAGCTAG GCGTCCCGCT ACGCGTCATC CCGGTCGGCA AGACCCCTTT CGGCAAAGTC GCCGGCGGAC ATTGA
|
Protein sequence | MTESAGTPAM LDYSDKIKDH FFNPKNAGPL ADANAVGEAG ALSCGDALKL MLKVDPLTEV ILDARFQTFG CGSAIASSSA LTELIIGKTI VQAAGLTNQQ IAESLGGLPP EKMHCAVLGH DALQAAIANF RGEAWTAGDA PGALICKCSG VGEGALKRAI RMNRLTSVDQ LTAFTKAGAG CFTCYDKLEA VLAQTNAELV AEGLIAKEAA FSLQTRAPRP APAAVAAAPE TPAQPVSRPR VQISPLAPSA PAGEPAMTNL RKIKLIEETI EELRPFLRKD GGDCELIDVD GSNVLVKMSG ACVLCKLASA TISGIQEKLV EKLGVPLRVI PVGKTPFGKV AGGH
|
| |