Gene Msil_3621 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_3621 
Symbol 
ID7092894 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp3986708 
End bp3987742 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content64% 
IMG OID643466909 
ProductFe-S cluster assembly protein NifU 
Protein accessionYP_002363868 
Protein GI217979721 
COG category[C] Energy production and conversion
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0694] Thioredoxin-like proteins and domains
[COG0822] NifU homolog involved in Fe-S cluster formation 
TIGRFAM ID[TIGR02000] Fe-S cluster assembly protein NifU 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.237955 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAAA GCGCCGGGAC CCCCGCCATG TTGGACTACA GCGACAAAAT CAAGGATCAT 
TTCTTCAATC CGAAGAATGC CGGTCCCCTC GCCGACGCCA ATGCGGTCGG AGAAGCTGGG
GCGCTCTCAT GCGGCGACGC CTTGAAGCTG ATGCTGAAAG TCGACCCGCT GACAGAGGTC
ATTCTCGACG CCCGCTTTCA GACATTCGGC TGCGGCTCGG CGATCGCTTC CTCTTCGGCG
CTGACCGAGC TCATCATCGG CAAGACAATC GTGCAAGCCG CCGGACTGAC CAACCAGCAG
ATCGCAGAGT CCCTTGGCGG CCTGCCGCCG GAAAAAATGC ATTGCGCGGT CCTCGGCCAT
GACGCGCTGC AGGCGGCGAT CGCGAATTTT CGCGGCGAAG CCTGGACGGC CGGCGACGCG
CCCGGCGCCC TGATCTGCAA ATGCTCCGGC GTCGGCGAAG GCGCGCTGAA GCGCGCCATC
CGGATGAACA GGCTGACAAG CGTCGACCAA CTCACCGCCT TTACCAAGGC CGGCGCCGGC
TGCTTCACCT GCTACGACAA GCTTGAGGCC GTCCTCGCGC AAACCAACGC CGAGTTGGTC
GCCGAAGGAT TGATCGCCAA GGAGGCCGCC TTTTCTCTGC AAACCCGCGC GCCAAGGCCG
GCGCCGGCCG CAGTCGCAGC GGCGCCCGAG ACCCCAGCGC AGCCGGTCTC GCGCCCCAGA
GTGCAGATTT CGCCGCTCGC CCCTTCGGCG CCCGCGGGCG AACCGGCGAT GACGAATTTG
CGCAAGATCA AGCTGATCGA GGAGACGATC GAGGAGCTGC GCCCGTTCCT GCGCAAGGAT
GGCGGCGATT GCGAATTGAT CGACGTCGAC GGCTCGAATG TCCTCGTCAA AATGTCGGGC
GCCTGCGTGC TCTGCAAACT CGCCAGCGCG ACGATCTCCG GCATTCAGGA AAAACTGGTC
GAAAAGCTAG GCGTCCCGCT ACGCGTCATC CCGGTCGGCA AGACCCCTTT CGGCAAAGTC
GCCGGCGGAC ATTGA
 
Protein sequence
MTESAGTPAM LDYSDKIKDH FFNPKNAGPL ADANAVGEAG ALSCGDALKL MLKVDPLTEV 
ILDARFQTFG CGSAIASSSA LTELIIGKTI VQAAGLTNQQ IAESLGGLPP EKMHCAVLGH
DALQAAIANF RGEAWTAGDA PGALICKCSG VGEGALKRAI RMNRLTSVDQ LTAFTKAGAG
CFTCYDKLEA VLAQTNAELV AEGLIAKEAA FSLQTRAPRP APAAVAAAPE TPAQPVSRPR
VQISPLAPSA PAGEPAMTNL RKIKLIEETI EELRPFLRKD GGDCELIDVD GSNVLVKMSG
ACVLCKLASA TISGIQEKLV EKLGVPLRVI PVGKTPFGKV AGGH