Gene PCC8801_1780 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_1780 
Symbol 
ID7105554 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp1866002 
End bp1867477 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content46% 
IMG OID643474848 
Productnitrogenase cofactor biosynthesis protein NifB 
Protein accessionYP_002371982 
Protein GI218246611 
COG category[R] General function prediction only 
COG ID[COG0535] Predicted Fe-S oxidoreductases 
TIGRFAM ID[TIGR01290] nitrogenase cofactor biosynthesis protein NifB 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTACAAT CCACCGGTCT AATTAACTCT GACTCATCGG CGGCCAAGCA AGCAAAACCT 
TGTGGCGGTT CCTCGGCTCA TAAGTCCCCT GGAGCCTCTG GTTGCAGTTC TTCCAATACC
CAAGCGATCG CACCTGATAT TCAGGAACGC ATTGCCAAAC ACCCTTGTTA TAGCGAAGAT
GCCCATCACC ACTATGCCCG CTTGCACGTT GCGGTTGCTC CTGCCTGTAA TATTCAATGC
AACTATTGCA ATCGCAAGTA TGACTGTGCC AATGAAAGCC GTCCTGGGGT CGTCAGCGAA
GTTCTGACTC CCGAAGAAGC CGCCCATAAA GCCTTAGTCA TTGCCGGAAA GATTCCCCAA
ATGACCGTTT TAGGAATTGC AGGGCCTGGA GATCCTTTAG CCAACCCAGA GCAAACTTTT
CGTACCTTTG AATTGGTGGC GGAGAAAGCT CCTGATATTA AACTCTGTTT GTCGAGTAAT
GGCTTAATGT TACCCGATTA TATTGATCGC ATCAAAGAAC TCAAAATTGA TCATGTCACC
TTAACCATCA ACATGATTGA CCCTGAAATT GGGGAAAAAA TCTATCCTTG GGTGCGCTTC
AACCGCAAAC GCTACAAAGG CCTTGAAGGG GTAAAAATCC TCCATGAACG GCAAATGGAA
AGCCTCGATG CCCTCAGAGA AGCGAATATC CTTTGCAAAG TTAACTCTGT TATGATTCCA
GGGATCAACG ATCATCACCT GGCCGAAGTT AACGAACTGA TTCGTTCTAA GGGTGCATTC
CTCCACAATA TTATGCCCCT GATTAGTGCC CCCGAACATG GAACCCATTT TGGACTAACG
GGACAACGCG GACCGACTCC TAAAGAATTA AAGACTGTTC AAGACAGTTG TTCGGGCAAC
ATGAAAATGA TGCGTCACTG TCGTCAATGT CGCGCTGATG CCGTGGGACT GTTAGGAGAA
GATCGTTCTC AGGAATTTAC TAAGGACAAA TTCCTCGAAA TGACCCCCGA ATACGACTTA
GCCAAACGGC AGGAAGTCCA CGAAGACATT GAGAAATTTA CCGCCGAAGT CAAGGCTGCT
AAAGCACAAG TGGCTGCTAG TAAGAAAGCA AGCGGAACCA AGATTTTAGT CGCGGTTGCT
ACGAAAGGAA ACCGTCTGGT TAACCAACAT TTCGGCCATG CTAAGGAATT TCAAATCTTT
GAAGTCGATG GCGTAGAGGT CAAATTTGTT GCCCACCGAA AAGTTGATCA TTATTGTCAA
AGTGGATACG GCGAAGAAGC CTCATTAGAG CATATTATTA AAGCAATAGG AGATTGCAAA
GGAGTGCTCG CCTCTAAGAT TGGTAGTTGC CCCCAAACCG AATTACGCAA AGCTGGTATA
GAACCCTTTG AAGCTTACGA TGTCATCGAT AAAGTGGCTT TGGATTTCTA CGAGCAATAT
GTGCAGTCAA CCCCCTTGGT AGGAGTGTCC TCATGA
 
Protein sequence
MLQSTGLINS DSSAAKQAKP CGGSSAHKSP GASGCSSSNT QAIAPDIQER IAKHPCYSED 
AHHHYARLHV AVAPACNIQC NYCNRKYDCA NESRPGVVSE VLTPEEAAHK ALVIAGKIPQ
MTVLGIAGPG DPLANPEQTF RTFELVAEKA PDIKLCLSSN GLMLPDYIDR IKELKIDHVT
LTINMIDPEI GEKIYPWVRF NRKRYKGLEG VKILHERQME SLDALREANI LCKVNSVMIP
GINDHHLAEV NELIRSKGAF LHNIMPLISA PEHGTHFGLT GQRGPTPKEL KTVQDSCSGN
MKMMRHCRQC RADAVGLLGE DRSQEFTKDK FLEMTPEYDL AKRQEVHEDI EKFTAEVKAA
KAQVAASKKA SGTKILVAVA TKGNRLVNQH FGHAKEFQIF EVDGVEVKFV AHRKVDHYCQ
SGYGEEASLE HIIKAIGDCK GVLASKIGSC PQTELRKAGI EPFEAYDVID KVALDFYEQY
VQSTPLVGVS S