Gene PCC8801_1788 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_1788 
Symbol 
ID7101850 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp1875290 
End bp1876702 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content45% 
IMG OID643474856 
Productnitrogenase MoFe cofactor biosynthesis protein NifE 
Protein accessionYP_002371990 
Protein GI218246619 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01283] nitrogenase molybdenum-iron cofactor biosynthesis protein NifE 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATTAA CTAAAGGCAA AATCAACGAA TTACTAACAC AACCAGGCTG CGAACATAAT 
CATAATAAGG AAGGACAAGG GAAAAACAAA TCTTGTACCC AACAGGCTCA ACCTGGCTCA
GCACAAGGGG GATGCGCTTT TGATGGGGCT TCTATTGCTC TGGTTCCGAT TACCGATGCT
GCCCATTTAG TCCACGGTTC AATCGCCTGT TCTGGTAATA GTTGGAACAG TCGGGGCAGT
CTGAGCAGTG GTCCGATGAC TTATAAAATG GGTTTTACAA CAGATTTATC AGAAAATGAT
GTTATTTTTG GTGGTGAAAA AAAGCTTTAT CAAGCGATCG CTCAATTAGT AAAACGCTAC
CATCCGGCGG CGGTTTTTGT CTATTCGACC TGTGTTACCG CGTTAATTGG AGATGATCTT
GATGCGGTGT GTAAAGCAGC CACAAAAAAA TATGAAACGC CGATTATTCC CGTTCATGCC
CCTGGATTTG TTGGTAGTAA AAACCTAGGA AACCGTCTCG GTGGTGAAGC ACTTCTTGAT
CATGTTGTGG GAACCCGTGA GCCAGAATTT ACCACGGATT TTGATATTAA TTTGATTGGA
GAATACAACG TCGCTGGAGA AATGTGGGGC GTTTTACCTC TGTTTGAAAA GTTAGGTATT
CGGGTGTTAG CTAAGATTAC GGGAGATGCC CGTTACGAAG AAGTTTGTTA TGCCCATCGT
GCTAAACTTA ATTTAATGAT CTGCTCTAAG GCTCTGATTA ACATGGCCAC AGCAATGCAA
GAGCGTTATG GTATTCCCTA CATTGAAGAG TCTTTCTATG GCATTGCAGA CATGAACCGT
TGTTTACGGA ACATCGCTGA GTATTTCGGA GATGCCGCTT TAAAAGAACG GGTAGAACAG
TTAATAGAAG AAGAAACCAC GAAATTAGAC CTAGCCTTAG CCCCCTACCG GGAACGTCTC
AAGGGTAAGC GCGTTGTCCT CTACACGGGA GGGGTCAAGA GTTGGTCGGT GGTGTCCGCA
GCGCAAGATT TAGGCATGGA AGTGGTGGCC ACCAGCACCA AGAAGAGTAC GGAAGAAGAT
AAAGCGAAGA TTCGAGAATT ATTAGGCAAA GATGGAATTA TGCTCGAAAA AGGCAGCCCG
ACGGAATTAT TGCGGGTTGT GGAGCAAACC AAGGCAGATT TATTAGTCGC AGGGGGTCGT
AATCAGTATA CCGCCCTCAA GGCTAGGATT CCTTTTTTGG ATATTAACCA AGAACGTCAC
CATCCCTACG CGGGATATGT TGGGATGATT GAGATGGCGC GAGAATTGGA CGAAGCCGTT
CATAGTCCTA TCTGGCGGTT AGTTCGTCAA CCTTCCCCTT GGGATATTTG GCAACAGGAA
CACGAAAGTT TATTGAATTT AGAAGCGGAA TAA
 
Protein sequence
MKLTKGKINE LLTQPGCEHN HNKEGQGKNK SCTQQAQPGS AQGGCAFDGA SIALVPITDA 
AHLVHGSIAC SGNSWNSRGS LSSGPMTYKM GFTTDLSEND VIFGGEKKLY QAIAQLVKRY
HPAAVFVYST CVTALIGDDL DAVCKAATKK YETPIIPVHA PGFVGSKNLG NRLGGEALLD
HVVGTREPEF TTDFDINLIG EYNVAGEMWG VLPLFEKLGI RVLAKITGDA RYEEVCYAHR
AKLNLMICSK ALINMATAMQ ERYGIPYIEE SFYGIADMNR CLRNIAEYFG DAALKERVEQ
LIEEETTKLD LALAPYRERL KGKRVVLYTG GVKSWSVVSA AQDLGMEVVA TSTKKSTEED
KAKIRELLGK DGIMLEKGSP TELLRVVEQT KADLLVAGGR NQYTALKARI PFLDINQERH
HPYAGYVGMI EMARELDEAV HSPIWRLVRQ PSPWDIWQQE HESLLNLEAE