Gene PCC8801_1787 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_1787 
Symbol 
ID7101849 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp1873451 
End bp1874986 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content45% 
IMG OID643474855 
Productnitrogenase molybdenum-iron protein beta chain 
Protein accessionYP_002371989 
Protein GI218246618 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01286] nitrogenase molybdenum-iron protein beta chain 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTCAGA ACATTGATAA AATCCAAGAC CACGTTGAGT TATTCCACCA ACCAGAGTAC 
CAAGAGCTAT TTGAAAACAA GAAAGCTCTC CAAGGAATGG CTTCTGATGA GAAAGTCGCT
GAAATAGCCG AATGGACCAA AACCTGGGAA TATCGGGAAA AGAACTTCGC TCGTGAAGCT
CTGACCATCA ACCCCGCTAA AGCTTGTCAA CCTTTGGGTG CTATCTTAGC TGCGGTTGGT
TTTGAAGGAA CCCTCCCCTT TGTGCATGGA TCACAAGGTT GTGTGGCTTA CTTCCGTACC
CACTTTACCC GTCACTTCAA AGAGCCTTTC AGTGGTGTTT CTTCTTCCAT GACTGAAGAT
GCAGCCGTCT TCGGTGGACT GAAAAACATG ATCGAAGGGT TACAGAATGC TTATAGTCTC
TATCAACCCA AAATGATTGC TGTCTGTACA ACTTGTATGG CAGAAGTTAT TGGGGATGAC
TTAGGTTCCT TCATTGGCAA TGCTAAGGCT GACGGTTCTG TTCCTCAAGA TTTCCCCGTT
CCCTTTGCTC ACACTCCTTC TTTCGTGGGT TCTCATATCA CGGGATATGA CAACATGATG
AAAGCCATCC TGTTGAACTT AACCGACGGC AAGAAACCTA CCACCAGCAA CGGTAAAGTT
AACTTTATTC CTGGGTTTGA AACCTATGTT GGTAACCTAC GCGAACTGAA GCATTTAACC
AGTGCTATGG GGGTTGATGC TACCATTTTA GGAGACAACG AACTCTATTT AGATTCTCCT
AACGATGGCG AGTTCAAAAT GTACCAAGGT GGTACTACTC TAGAAGAAGG TGCTGATGCT
ATCAATGCAA CCAAAACCAT CGCACTGCAA ACCTATCCCA CCGTTAAAAC CCTCGAATAC
ATCGAGAAAG AATGGCAGCA ACCCACCGCT ACCTATCGTC CTTGGGGTAT TAAAGGAACG
GATGAGTTCG TCATGGCTTT ATCTGAACTC ACTGGGAATC CTGTTCCTCC CGAATTGGAA
CTAGAACGGG GACGCGCAGT GGATGCTATG ACCGATAGTC ATGCTTGGTT ACATGGTAAA
AAAGCGGCTA TCTATGGCGA CCCTGACTTA GTCATGGGAA TGCTGCAATT CATGTTAGAG
ATGGGTGTTG AACCTGTTCA CGTTTTGGTT CACAACTCTA CCACTGAATT TGAAGAAGAA
GCCAAAGCTC TCTTAGCTTC TAGTCCTTAT GGTCAAAAAG CCACCGTTTG GGGCGGTAAA
GACCTCTGGC ACCTCCGTTC CTTACTGTTT ACTGAACCTG TTGACTTCTT AATCGGGAAT
TCCTACGGTA AATACCTCTG GCGTGATACC AAGATTCCTT TAATCCGCAT CGGGTATCCT
ATCTTTGATC GCCACCACTT ACATCGCTAT TCTACCATTG GTTACAATGG CGCGATTAAC
CTGCTCAATT GGATTGTTAA TGGTCTGTTT GAAGAAATCG ACCGCAACAC CAATATCCCC
TCGAAGACCG ACATTTCCTT CGATTTAGTT CGTTAA
 
Protein sequence
MSQNIDKIQD HVELFHQPEY QELFENKKAL QGMASDEKVA EIAEWTKTWE YREKNFAREA 
LTINPAKACQ PLGAILAAVG FEGTLPFVHG SQGCVAYFRT HFTRHFKEPF SGVSSSMTED
AAVFGGLKNM IEGLQNAYSL YQPKMIAVCT TCMAEVIGDD LGSFIGNAKA DGSVPQDFPV
PFAHTPSFVG SHITGYDNMM KAILLNLTDG KKPTTSNGKV NFIPGFETYV GNLRELKHLT
SAMGVDATIL GDNELYLDSP NDGEFKMYQG GTTLEEGADA INATKTIALQ TYPTVKTLEY
IEKEWQQPTA TYRPWGIKGT DEFVMALSEL TGNPVPPELE LERGRAVDAM TDSHAWLHGK
KAAIYGDPDL VMGMLQFMLE MGVEPVHVLV HNSTTEFEEE AKALLASSPY GQKATVWGGK
DLWHLRSLLF TEPVDFLIGN SYGKYLWRDT KIPLIRIGYP IFDRHHLHRY STIGYNGAIN
LLNWIVNGLF EEIDRNTNIP SKTDISFDLV R