Gene Cyan8802_1816 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_1816 
Symbol 
ID8391130 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp1849921 
End bp1851333 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content45% 
IMG OID644979803 
Productnitrogenase MoFe cofactor biosynthesis protein NifE 
Protein accessionYP_003137550 
Protein GI257059662 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01283] nitrogenase molybdenum-iron cofactor biosynthesis protein NifE 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.615494 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTAA CTAAAGGCAA AATCAACGAA CTGCTAACAC AACCAGGCTG CGAACATAAT 
CATAATAAGG AAGGACAAGG GAAAAACAAA TCTTGTACTC AACAGGCTCA ACCTGGCTCA
GCACAAGGGG GATGCGCTTT TGATGGGGCT TCTATTGCTC TGGTTCCGAT TACCGATGCT
GCCCATTTAG TCCACGGTTC TATCGCCTGT TCTGGTAATA GTTGGAACAG TCGAGGCAGT
CTGAGCAGTG GTCCGATGAC TTATAAAATG GGTTTTACAA CAGATTTATC AGAAAATGAT
GTTATTTTTG GTGGTGAAAA AAAGCTTTAT CAAGCGATCG CTCAATTAGT AAAACGCTAC
CATCCGGCGG CGGTTTTTGT CTATTCGACC TGTGTTACCG CGTTAATTGG AGATGATCTT
GATGCGGTGT GTAAAGCAGC CACAAAAAAA TATGAAACGC CGATTATTCC CGTTCATGCC
CCTGGATTTG TTGGTAGTAA AAACCTAGGA AACCGTCTCG GTGGTGAAGC ACTTCTTGAT
CATGTTGTGG GAACCCGTGA GCCAGAATTT ACCACGGATT TTGATATTAA TTTGATTGGA
GAATACAACG TCGCTGGAGA AATGTGGGGC GTTTTACCTC TATTTGAAAA GTTAGGTATT
CGGGTGTTAG CTAAGATTAC GGGAGATGCC CGTTACGAAG AAGTTTGTTA TGCCCATCGT
GCTAAACTTA ATTTAATGAT CTGCTCTAAG GCTCTGATTA ACATGGCCAC AGCAATGCAA
GAGCGTTATG GTATTCCCTA CATTGAAGAG TCTTTCTATG GCATTGCAGA CATGAACCGT
TGTTTACGGA ACATCGCTGA GTATTTCGGA GATGCCGCTT TAAAAGAACG GGTAGAACAG
TTAATAGAAG AAGAAACCAC GAAATTAGAC CTAGCCTTAG CCCCCTACCG GGAACGTCTC
AAGGGTAAGC GCGTTGTCCT CTACACCGGA GGGGTTAAGA GTTGGTCGGT GGTGTCCGCA
GCGCAAGATT TAGGCATGGA AGTGGTGGCC ACCAGCACCA AGAAGAGTAC GGAAGAAGAT
AAAGCGAAGA TTCGAGAATT ATTAGGCAAA GATGGAATTA TGCTCGAAAA AGGCAGCCCG
ACGGAATTAT TGCGGGTTGT GGAGCAAACC AAGGCAGATT TATTAGTCGC AGGGGGTCGT
AATCAGTATA CCGCCCTTAA GGCTAGGATT CCTTTTTTGG ATATTAACCA AGAACGTCAC
CATCCCTACG CGGGATATGT TGGGATGATT GAAATGGCGC GGGAATTGGA CGAAGCCGTT
CATAGTCCTA TCTGGCGGTT AGTTCGTCAA CCTTCCCCTT GGGATATTTG GCAACAGGAA
CACGAAAGTT TATTGAATTT AGAAGCGGAA TAA
 
Protein sequence
MKLTKGKINE LLTQPGCEHN HNKEGQGKNK SCTQQAQPGS AQGGCAFDGA SIALVPITDA 
AHLVHGSIAC SGNSWNSRGS LSSGPMTYKM GFTTDLSEND VIFGGEKKLY QAIAQLVKRY
HPAAVFVYST CVTALIGDDL DAVCKAATKK YETPIIPVHA PGFVGSKNLG NRLGGEALLD
HVVGTREPEF TTDFDINLIG EYNVAGEMWG VLPLFEKLGI RVLAKITGDA RYEEVCYAHR
AKLNLMICSK ALINMATAMQ ERYGIPYIEE SFYGIADMNR CLRNIAEYFG DAALKERVEQ
LIEEETTKLD LALAPYRERL KGKRVVLYTG GVKSWSVVSA AQDLGMEVVA TSTKKSTEED
KAKIRELLGK DGIMLEKGSP TELLRVVEQT KADLLVAGGR NQYTALKARI PFLDINQERH
HPYAGYVGMI EMARELDEAV HSPIWRLVRQ PSPWDIWQQE HESLLNLEAE