Gene PCC8801_1786 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_1786 
Symbol 
ID7105556 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp1871844 
End bp1873274 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content45% 
IMG OID643474854 
Productnitrogenase molybdenum-iron protein alpha chain 
Protein accessionYP_002371988 
Protein GI218246617 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01282] nitrogenase molybdenum-iron protein alpha chain
[TIGR01862] nitrogenase component I, alpha chain 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAACAG TAGAAGACAG AAAGCAGCTT ATCCAAGACG TTCTTGATAC CTATCCTGAG 
AAGTTAGCCA AGAAACGGTC TAAACACCTC AATGTTTACG AAGAAGGCAA AGACGATTGT
GGAGTAAAAT CTAACATTAA GTCTGCACCT GGTGTAATGA CCGCTCGTGG TTGTGCTTAT
GCAGGATCTA AAGGGGTGGT TTGGGGTCCT ATCAAAGATA TGATCCATAT CTCCCACGGA
CCTGTTGGTT GCGGTTACTA CTCTTGGTCT GGTCGTCGTA ACTATTACAT CGGAACCACT
GGGGTTGATA CCTTTGGTAC GATGAACTTT ACCTCTGACT TCCAAGAAAA AGACATCGTT
TTTGGTGGAG ACAAAAAACT CCTCAAAATC ACCGAAGAAA TCGAAGAATT ATTCCCCCTC
AACAATGGGA TTTCCATTCA GTCTGAATGT CCTGTTGGAT TAATTGGGGA TGACATCGAA
GGTGTTGCCA AAAAAGCGCA AAAAATTACT GGCAAACCCG TTATTCCCGT CCGTTGTGAA
GGATTCCGTG GCGTTTCCCA ATCCTTAGGA CACCACATCG CTAACGACGC AGTGCGTGAC
TGGGTATTTA GCCGTGATGA TGCTCAAGAA ATCGAAACCA CTCCCTATGA TGTTGCCATC
ATTGGAGACT ACAACATCGG TGGAGATGCT TGGTCTAGCC GTATTCTTCT CGAAGAAATG
GGTCTGCGCG TCGTTGCTCA ATGGTCTGGA GACGGAACCA TCAACGAAAT GATGCAAACC
CCCAAAGTGA AACTCAACCT GATTCACTGT TACCGTTCCA TGAACTACAT CAGTCGTCAC
ATGGAAGAAA AATACGGTAT TCCCTGGTTT GAGTACAACT TCTTTGGTCC TACCAAGATT
GCTGAATCCT TACGCGCGAT CGCTGCTCTG TTTGATGACA CCATCAAAGA AAATGCAGAG
AAAGTAATTG CTAAGTACGA ACAACAAACC GCAGAAGTCT TAGCCAAATA CCGTCCTCGT
TTGGAAAACA AAACCGTCAT GATGATGGTG GGTGGACTAC GTCCTCGTCA CGTTGTTCCT
GCTTTCACAG ACTTAGGCAT GAAAATGATC GGAACCGGAT ATGAGTTCGC TCACGGTGAC
GACTATAAAC GTACCACTGA GTATGTTGAT GATGCAACCC TCATCTATGA TGACGTAACT
GCCTACGAGT TCGAGAAATT CGTTCAAGAA CTGAAACCCG ACTTAGTTGC TTCTGGCGTT
AAAGAGAAGT ATGTCTTCCA GAAAATGGGA CTACCTTTCC GTCAAATGCA CTCTTGGGAT
TACTCTGGTC CTTACCACGG TTATGATGGG TTCGCTATCT TTGCACGGGA TATGGACTTA
GCTCTCAATA ACCCGACCTG GGGATTAATC AAATCTCCTT GGAATAAGTA A
 
Protein sequence
MSTVEDRKQL IQDVLDTYPE KLAKKRSKHL NVYEEGKDDC GVKSNIKSAP GVMTARGCAY 
AGSKGVVWGP IKDMIHISHG PVGCGYYSWS GRRNYYIGTT GVDTFGTMNF TSDFQEKDIV
FGGDKKLLKI TEEIEELFPL NNGISIQSEC PVGLIGDDIE GVAKKAQKIT GKPVIPVRCE
GFRGVSQSLG HHIANDAVRD WVFSRDDAQE IETTPYDVAI IGDYNIGGDA WSSRILLEEM
GLRVVAQWSG DGTINEMMQT PKVKLNLIHC YRSMNYISRH MEEKYGIPWF EYNFFGPTKI
AESLRAIAAL FDDTIKENAE KVIAKYEQQT AEVLAKYRPR LENKTVMMMV GGLRPRHVVP
AFTDLGMKMI GTGYEFAHGD DYKRTTEYVD DATLIYDDVT AYEFEKFVQE LKPDLVASGV
KEKYVFQKMG LPFRQMHSWD YSGPYHGYDG FAIFARDMDL ALNNPTWGLI KSPWNK