Gene MCA0764 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA0764 
SymbolnifA 
ID3102395 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp803938 
End bp805473 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content62% 
IMG OID637169973 
Producttranscriptional regulator NifA 
Protein accessionYP_113267 
Protein GI53804913 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3604] Transcriptional regulator containing GAF, AAA-type ATPase, and DNA binding domains 
TIGRFAM ID[TIGR01817] Nif-specific regulatory protein 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTGATC GCTACACATT GATCGAGCGG GAGCTCGACG CCCTCTATCA GGTCGGCCAG 
GTGTTGAACA GCACGGCCGA TCTCAAGGGC AGGCTGGAGG GTGTGCTCAA CGTCCTGCAC
GAGCAGGCCG ATCTGCGTTC CGGGATGATC GCGCTGCGGG AGCCGGAAAC CGATGCGCTG
GTATTGGCGG TACTGCGGCG GGGCGAAGCC GGGCGGACTT CCAACGAGCC GGTGCGTTAC
GAGCCGGGCG AAGGACTGAT CGGCACCATT CTGGAAGCAA ACTGCACCAT TGTGGTGGAG
CGCATCGCTG ACGAGCCGCG TTTCCTCGGC CGCCTCGGTC TCTACGATCC GGAGTTGCCC
TTCATCGGCT CGCCCATCCA TGTGGGCAAG GATGAACTGC TCGGGGTGCT GGCGGCTCAG
CCCAGCGAGC GTGAGCTGCT GGGCGAGCGC GCCAGGTTCC TGGAGATGGT GAGCAATCTG
GTCGCGCAGA GCGTCGGCCT GCTGCGCGGG ATGGAACAGA AGCAGCGCGA TCTCACGACC
CAATGCGAGC AGCTGCAACA GACGCTGCGC TCGAATTACG GATTCGAGAA CATCATCGGC
CGGACGCCGC CGATGCTGCG TGTCTTCGAG ACCGTGCGCC AGGTCGCGAA ATGGAACACG
ACGGTGCTCA TCCGGGGTGA GTCGGGCACC GGCAAGGAGA TGATCGCCAG CGCGATCCAC
TTCAATTCGC CCCGGGCGAG TGGCCCCTTC GTGAAGCTCA ACTGTGCCGC CTTGCCTGAA
AACCTGCTGG AGTCGGAACT GTTCGGCCAC GAGAAAGGCG CCTTCAGCGG CGCCGTCAAC
CAGCGCAAGG GGCGGTTCGA GCTGGCCAAC CACGGTACCC TTTTTCTGGA TGAAATCGGC
GAGATTTCAC CGGCTTTCCA GGCCAAGCTA CTGCGGGTGC TGCAGGAGGG TGAATTCGAA
CGGGTCGGCG GCATCCACAC GCTCAAAGTC GATGTGAGGA TCATTGCGGC GACCAACCGG
GATCTCGAGT CTGCGGTGGA AGAAGGGGCA TTTCGTGAGG ACCTTTATTA TCGTCTCAAC
GTGATGCCCA TCCAGATGCC TCCTCTGCGG CAGCGCAAGG AGGACATTCC GGAGCTCGCC
CGCTTCCTGC TGGGCAGAAT CTCCCGCAAC CAGGGCGGCC GGCTTCTGGA AATCAAGGAG
AGCGCCATCC GTTCGCTGAT GCGGCACGAC TGGCCCGGCA ACGTGCGCGA ACTGGAAAAC
CTGCTCGAAC GCGCGGCGAT CATGAGCCGT GAAGGGGTCA TCGACAGTGA GGTGATCGGC
ATGACCGGGC TCAAGGAGAA GCTGACGACG GTGGCCGATG TGGCTTCCTA CAAAGTCGAC
CTGGGTGATG AGAACCTGGA TGAACGGGAG CGCATCATCG CCGCACTCGA ACAGAGCGGC
TGGGTGCAGG CGAAAGCCGC ACGCCTGCTC GACATGACGC CGAGACAGAT CGCCTACCGT
GTCAAAATCC TGAATATCCA CATGAAACAC CTGTAG
 
Protein sequence
MVDRYTLIER ELDALYQVGQ VLNSTADLKG RLEGVLNVLH EQADLRSGMI ALREPETDAL 
VLAVLRRGEA GRTSNEPVRY EPGEGLIGTI LEANCTIVVE RIADEPRFLG RLGLYDPELP
FIGSPIHVGK DELLGVLAAQ PSERELLGER ARFLEMVSNL VAQSVGLLRG MEQKQRDLTT
QCEQLQQTLR SNYGFENIIG RTPPMLRVFE TVRQVAKWNT TVLIRGESGT GKEMIASAIH
FNSPRASGPF VKLNCAALPE NLLESELFGH EKGAFSGAVN QRKGRFELAN HGTLFLDEIG
EISPAFQAKL LRVLQEGEFE RVGGIHTLKV DVRIIAATNR DLESAVEEGA FREDLYYRLN
VMPIQMPPLR QRKEDIPELA RFLLGRISRN QGGRLLEIKE SAIRSLMRHD WPGNVRELEN
LLERAAIMSR EGVIDSEVIG MTGLKEKLTT VADVASYKVD LGDENLDERE RIIAALEQSG
WVQAKAARLL DMTPRQIAYR VKILNIHMKH L