Gene Mnod_4004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMnod_4004 
Symbol 
ID7307456 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium nodulans ORS 2060 
KingdomBacteria 
Replicon accessionNC_011894 
Strand
Start bp4079172 
End bp4080971 
Gene Length1800 bp 
Protein Length599 aa 
Translation table11 
GC content69% 
IMG OID643601662 
Producttranscriptional regulator, NifA, Fis Family 
Protein accessionYP_002499192 
Protein GI220923890 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3604] Transcriptional regulator containing GAF, AAA-type ATPase, and DNA binding domains 
TIGRFAM ID[TIGR01817] Nif-specific regulatory protein 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACTACG CGGACCTTTT TCCCTCCCGA ATCGAGACGG GCCTGCCCCA GGTGGAGCAC 
GTCGTCCCGG AGCGGCTTGG GCAGCAGCCC GCCCCGCCGA GCCCGACCCC GAGGCTGCCC
GTCCAAGCGC GTCCCGCCTC AGCCTCGCGC GGCACCTCGG CCGAGGTGCA GCTCATCGGC
ATCTACGAGA TCTCCAAGGC GCTGACCGCG GTGCGCCACC TCGAGGTCAC CCTCACTAAG
GTGATCGACA TCCTCTCGTC GACTCTGAGC ATGCACCACG GCATGATCGT CATCCTCGAC
CAGGAGGGCG GGCCGGAAAT CGTCTCAAGC ACCGGCTGGA CGGCCCAGAT CGCCCACCAC
CTCCGCGCGT GCCTGCCCCA GCGGGCCATC GACCAAATCG TCGACACCGC GGCCCCGCTC
GTCGTGCAGG ACGTCAGCGC CGACCCGCTC TTCCACGGCC ATCTCGATCT GTTCGAGGAT
GCCGGCAAGG CCATCACGTC GTTCATCGGC GTGCCGATTA AGGCGGATTC ACGGGTCCTC
GGCACCCTCT CAATTGACCG GATCTGTGAC GGCAGCGCGC GCTTCTGCTC CGACGAGGAC
ACGCGCTTTC TCACCATGGT GGCGAACCTC ATCGGCCAGA CGGTGTGGCT CCACAACACG
CTGGCGCAGG ATCGCGACCG GCTCATCGCC AAAGCCCACC GGCTGGAGAA GGCCCTGGCG
GAAACGAGCG CGATCTCCTC CTCAGTCGGC CTTGTCGGCG AGAGCCAGGA GCTGAAGCGG
CTCGCCGCGA AGGCCGAGGT CGCGGCGCGC TCGAACACGA CTGTCCTGTT GCGCGGCGAG
AGCGGCACTG GCAAGGAGCT CTTCGCCCGC GCGATCCACG AACTCTCGCC CCGCAAGAGC
AAGCCCTTCG TGCGGGTCAA CTGCGCGGCG CTCGCCGACA GCGTGCTCGA ATCCGAGTTG
TTCGGGCACG AGAAGGGGGC CTTCACGGGC GCGATCGCGA CGCGCCATGG CCGCTTCGAG
GCCGCCAACG GCGGGACGCT GTTCCTCGAC GAGATCGGCG AGGTGAGCGC CACCTTCCAG
GCCAAGCTCC TGCGCGTCCT GCAGGAGGGG GAGTTCGAGC GCGTCGGCGG CAACCGCACC
ATCAAGGTCG ACGTGCGCCT CGTCTTCGCG ACGAACCGGA ACCTGGAGGA GGCGGTCACC
AAGGGCGACT TCCGCGCCGA CCTCTACTAC CGCATCAACG TGGTGTCGCT GATCCTGCCG
CCGCTGCGCG AGCGGCGGGG CGACATCCCG GATCTCGCGA AGGCCTTCCT CACCCGCTAC
AACAGCGAGA ACGGGTCGAA GCTCGCATTC TCGCAAGGCG CGATGGGGGT CCTGCTGAAG
TGCTACTTCC CGGGCAACGT GCGCGAGCTT GAGAACTGCG TCAGGGGCAC CGCGACGCTA
GCGGCCTCCG AGGAGCTGAT CATGTCGGAC GACTTCGCAT GCGTGAGCGG GCAATGCCTC
TCGGCGGTGC TCTGGAAGAG GAGCACGCCG AAGCCCTGGG ACGCCGCGGT GACGGCCGCC
ACACCCGTGC CGGTGAGCCC GGAGCGGCCG CCCACCCCGG CGGAGGCCGA GCCGCCCGCG
TCCTGCCCCG GCGCCAAGAC CCTCCCGGGC GTCCGGCCGC GCCCCACCAA GCAGCAGTTG
CTCGAGGCCC TAGAGCGCAC CGGCTATGTG CAGGCTAAGG CCGCCCGCCT CCTCGAGATC
ACGCCCCGCC AGCTCGGCTA TGCGGTGCGC AAGTACGACA TCCCCCTCAA AGAGTTCTGA
 
Protein sequence
MDYADLFPSR IETGLPQVEH VVPERLGQQP APPSPTPRLP VQARPASASR GTSAEVQLIG 
IYEISKALTA VRHLEVTLTK VIDILSSTLS MHHGMIVILD QEGGPEIVSS TGWTAQIAHH
LRACLPQRAI DQIVDTAAPL VVQDVSADPL FHGHLDLFED AGKAITSFIG VPIKADSRVL
GTLSIDRICD GSARFCSDED TRFLTMVANL IGQTVWLHNT LAQDRDRLIA KAHRLEKALA
ETSAISSSVG LVGESQELKR LAAKAEVAAR SNTTVLLRGE SGTGKELFAR AIHELSPRKS
KPFVRVNCAA LADSVLESEL FGHEKGAFTG AIATRHGRFE AANGGTLFLD EIGEVSATFQ
AKLLRVLQEG EFERVGGNRT IKVDVRLVFA TNRNLEEAVT KGDFRADLYY RINVVSLILP
PLRERRGDIP DLAKAFLTRY NSENGSKLAF SQGAMGVLLK CYFPGNVREL ENCVRGTATL
AASEELIMSD DFACVSGQCL SAVLWKRSTP KPWDAAVTAA TPVPVSPERP PTPAEAEPPA
SCPGAKTLPG VRPRPTKQQL LEALERTGYV QAKAARLLEI TPRQLGYAVR KYDIPLKEF