Gene PCC8801_1789 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_1789 
Symbol 
ID7104994 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp1876772 
End bp1878157 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content45% 
IMG OID643474857 
Productnitrogenase molybdenum-cofactor biosynthesis protein NifN 
Protein accessionYP_002371991 
Protein GI218246620 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01285] nitrogenase molybdenum-iron cofactor biosynthesis protein NifN 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTATTG TTCTTAATCC GAAAAAACCG TTATCGGTGA ATCCATTAAA AATGAGTCAA 
CCTTTGGGGG CTTCCTTGGC CTTTTTGGGG TTAAAAGGGA TGATGCCCTT ATTTCATGGG
GCTCAAGGCT GTACCGCCTT TGCTAAAGTG GTTCTGGTGC GTCATTTCCG CGAATCTATT
CCCCTGTCTA CCACGGCGAT GACGGAGGTT AGTACCATTT TAGGGGGTCA AGATCACGTT
GAACAGGCCA TTTTAACTAT CGTTGACAAG AACAAACCCG AAATCATCGG ACTGCTGACC
ACTGGGTTAA CGGAAACCCG TGGGGATGAT ATGGAGGGTA TTCTTAAGGA TATCCGTCAA
AAGCACCCAC AATTAAAGAA TTTACCGATT GTTTTTGTCT CTACGCCCGA TTATAAGGGG
TCACTACAGG ATGGCTACGC AGCCACGGTA GAGCAAATCG TTGCGACCGA TTATAATGCC
TTTATCGCCG AAAATGCCCG AAGTGCGGTC ATTTATCCCC AACCCCAGGT GACGGTTTTG
GCGGGTTCTT CCCTATCTCC TGGGGATATC CAAGAAATTA AGTCCATTAT TGAAGCTTTT
GGCTTAATGC CCCTGGTTAT TCCCGATTTA TCGAGATCTC TCGATGGTCA TCTAGAGGAT
GGCTATCAGT CCATAACGGG AGGAGGAACG ACCCTGCCCC AGTTGCGATC GCTGCCCCAT
TCCTGTTATA CCCTAGCTAT TGGGGAAAGT ATGCGCGGGG CAGCAGAAAT CTTAAAAGAC
CGTTTTGGAA CGAATTATGA AGTATTCCCC CGTTTAGCAG GATTAGAGGC GGTAGATACT
TTTTTATGGC GATTATCGCA GATTGTTACC TCTCGCTGCG ATCATCATTT CCCCATTGTT
CCTAATATTC CGGCTTTATT TGAACGCCAA CGCCGCCAGT TACAAGATGC TATTCTTGAC
ACCCATTTCT ATTTTGGGGG TAAAAAAGTT GCCCTCGCAT TAGAACCCGA TTTACTCCAT
CAAACGGCTT GGTTATTGAC AGAAATGGGT GCAAAAATTC AGGCGGCTGT TACCACTACT
AAGTCACCTT TATTGGAAGA TTTGCCTGTT GATACTGTGA CTATTGGCGA CTTAGAAGAT
TTAGAAGATT TGTCCGCAGG GGTCGATTTA ATTATTACCA ATTCCCACGG CACAGCAATG
GCACAACGGT TAAATGCGCC CTTGTATCGT ATGGGTTATC CGGTGTTTGA TCAGTTAGGA
AATGGTCAAC GCTGTTTAGT TGGATATCGT GGAACAATAC AATTTTTGTT TGATGTTGGC
AATATTTTAT TAGCCGAAGA AGCAAACCAC AATCATCAAT TATCGGTCGG GGTTCATGTT
ACCTAA
 
Protein sequence
MTIVLNPKKP LSVNPLKMSQ PLGASLAFLG LKGMMPLFHG AQGCTAFAKV VLVRHFRESI 
PLSTTAMTEV STILGGQDHV EQAILTIVDK NKPEIIGLLT TGLTETRGDD MEGILKDIRQ
KHPQLKNLPI VFVSTPDYKG SLQDGYAATV EQIVATDYNA FIAENARSAV IYPQPQVTVL
AGSSLSPGDI QEIKSIIEAF GLMPLVIPDL SRSLDGHLED GYQSITGGGT TLPQLRSLPH
SCYTLAIGES MRGAAEILKD RFGTNYEVFP RLAGLEAVDT FLWRLSQIVT SRCDHHFPIV
PNIPALFERQ RRQLQDAILD THFYFGGKKV ALALEPDLLH QTAWLLTEMG AKIQAAVTTT
KSPLLEDLPV DTVTIGDLED LEDLSAGVDL IITNSHGTAM AQRLNAPLYR MGYPVFDQLG
NGQRCLVGYR GTIQFLFDVG NILLAEEANH NHQLSVGVHV T