Gene PCC8801_1784 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_1784 
Symbol 
ID7101847 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp1869530 
End bp1870414 
Gene Length885 bp 
Protein Length294 aa 
Translation table11 
GC content42% 
IMG OID643474852 
ProductFe-S cluster assembly protein NifU 
Protein accessionYP_002371986 
Protein GI218246615 
COG category[C] Energy production and conversion
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0694] Thioredoxin-like proteins and domains
[COG0822] NifU homolog involved in Fe-S cluster formation 
TIGRFAM ID[TIGR02000] Fe-S cluster assembly protein NifU 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGGGAAT ATACAGATAA GGTAATGGAG TTCTTCTACA ACCCGCGTAA TCAAGGAACG 
ATTACAGAAA AGCAAGAAGG ACAAGCCATT ACAACTGGAG AAGTCGGAAG CATTGCCTGT
GGTGATGCCC TAAGATTACA CCTCAAAATT GATGAAGCGA CTCAAATTAT TCTTGATGCT
AGATTTCAAA CTTTTGGTTG TGCCTCTGCC ATTGCTTCCT CCTCCGCCTT GACGGAATTA
TTAGTTGGAA AAACCCTCGA CGAAGCCCTC AGTTTAACCA ATAGAGAAAT TGCTGAATTT
TTAGGGGGTT TACCCGAAGA GAAAATGCAC TGTTCTGTGA TGGGACAAGA AGCCTTAGAA
GCTGCTATTT TTAATTACCG AGGCATTCCT TTAGACCACC ATGAAGATGA CGAAGGAGCC
CTGATTTGCA AATGTTTTGG AGTAACTGAT GCGAGGATTC GTCGTGTTAT TATCGAAAAT
GATCTGACCA CAGCCGAACA AGTTACCAAC TATGTTAAAG CCGGTGGAGG ATGTAGTTCT
TGTCTGTCTG ATATCGATGA TATTTTAGCC GATATTACTC AAGAAAAAGC CACCGCCGTC
ACAGCAGCCA CCGAAGTTGT TCAAAGTAAA TTAACTCCCC AAAAACCCCT AAATAACTTA
CAAAAAATCA CCCTCATTCA ACAAATTCTC GACGAAGAAA TTAAACCAGC CTTGGCAAAA
GATGGAGGAG ATGTAGAGTT ATTTGATGTC GAAGGAGATT TGGTCAAAGT GATATTACAA
GGAGCCTGTG GTTCCTGTGC CAGCAGTACC CAAACCTTAA AAATGGGAAT CGAAGCCAGA
TTACGAGAGC GTGTTTCTCC TGAGTTAACG GTTATTTCTG TGTAA
 
Protein sequence
MWEYTDKVME FFYNPRNQGT ITEKQEGQAI TTGEVGSIAC GDALRLHLKI DEATQIILDA 
RFQTFGCASA IASSSALTEL LVGKTLDEAL SLTNREIAEF LGGLPEEKMH CSVMGQEALE
AAIFNYRGIP LDHHEDDEGA LICKCFGVTD ARIRRVIIEN DLTTAEQVTN YVKAGGGCSS
CLSDIDDILA DITQEKATAV TAATEVVQSK LTPQKPLNNL QKITLIQQIL DEEIKPALAK
DGGDVELFDV EGDLVKVILQ GACGSCASST QTLKMGIEAR LRERVSPELT VISV