Gene Cyan8802_1814 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_1814 
Symbol 
ID8391128 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp1846580 
End bp1848010 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content45% 
IMG OID644979801 
Productnitrogenase molybdenum-iron protein alpha chain 
Protein accessionYP_003137548 
Protein GI257059660 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01282] nitrogenase molybdenum-iron protein alpha chain
[TIGR01862] nitrogenase component I, alpha chain 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAACAG TAGAAGACAG AAAGCAGCTT ATCCAAGACG TTCTTGATAC CTATCCTGAG 
AAGTTAGCCA AGAAACGGTC TAAACACCTC AATGTTTACG AAGAAGGCAA AGACGATTGT
GGAGTAAAAT CTAACATTAA GTCTGCACCT GGTGTAATGA CCGCTCGTGG TTGTGCTTAT
GCAGGATCTA AAGGGGTGGT TTGGGGTCCT ATCAAAGATA TGATCCATAT CTCCCACGGA
CCTGTTGGTT GCGGTTACTA CTCTTGGTCT GGTCGTCGTA ACTATTACAT CGGAACCACT
GGGGTTGATA CCTTTGGTAC GATGAACTTT ACCTCTGACT TCCAAGAAAA AGACATCGTT
TTTGGTGGAG ACAAAAAACT CCTCAAAATC ACCGAAGAAA TCGAAGAATT ATTCCCCCTC
AACAATGGGA TTTCCATTCA GTCTGAATGT CCTGTTGGAT TAATTGGGGA TGACATCGAA
GGTGTTGCCA AAAAAGCGCA AAAAATTACT GGCAAACCCG TTATTCCCGT CCGTTGTGAA
GGATTCCGTG GCGTTTCCCA ATCCTTAGGA CACCACATCG CTAACGACGC AGTGCGTGAC
TGGGTATTTA GCCGTGATGA TGCTCAAGAA ATCGAAACCA CTCCCTATGA TGTTGCCATC
ATTGGAGACT ACAACATCGG TGGAGATGCT TGGTCTAGCC GTATTCTTCT TGAAGAAATG
GGTCTGCGCG TCGTTGCTCA ATGGTCTGGA GACGGAACCA TCAACGAAAT GATGCAAACC
CCCAAAGTGA AACTCAACCT GATTCACTGT TACCGTTCCA TGAACTACAT CAGTCGTCAC
ATGGAAGAAA AATACGGTAT TCCCTGGTTT GAGTACAACT TCTTTGGTCC TACCAAGATT
GCTGAATCCT TACGCGCGAT CGCTGCTCTG TTTGATGACA CCATCAAAGA AAATGCAGAG
AAAGTCATTG CTAAGTACGA ACAACAAACC GCAGAAGTCT TAGCCAAATA CCGTCCTCGT
TTGGAAAACA AAACCGTCAT GATGATGGTG GGTGGACTGC GTCCTCGTCA CGTTGTTCCT
GCTTTCACAG ACTTAGGCAT GAAAATGATC GGAACCGGAT ATGAGTTCGC TCACGGTGAC
GACTATAAAC GTACCACTGA GTATGTTGAT GATGCAACCC TCATCTATGA TGACGTAACT
GCTTATGAGT TCGAGAAATT CGTTCAAGAA CTCAAACCCG ACTTAGTCGC TTCTGGCGTT
AAAGAGAAGT ATGTCTTCCA GAAAATGGGA CTACCTTTCC GTCAAATGCA CTCTTGGGAT
TACTCTGGTC CTTACCACGG TTATGATGGG TTCGCTATCT TTGCACGGGA TATGGACTTA
GCTCTCAATA ACCCGACCTG GGGATTAATC AAATCTCCTT GGAATAAGTA A
 
Protein sequence
MSTVEDRKQL IQDVLDTYPE KLAKKRSKHL NVYEEGKDDC GVKSNIKSAP GVMTARGCAY 
AGSKGVVWGP IKDMIHISHG PVGCGYYSWS GRRNYYIGTT GVDTFGTMNF TSDFQEKDIV
FGGDKKLLKI TEEIEELFPL NNGISIQSEC PVGLIGDDIE GVAKKAQKIT GKPVIPVRCE
GFRGVSQSLG HHIANDAVRD WVFSRDDAQE IETTPYDVAI IGDYNIGGDA WSSRILLEEM
GLRVVAQWSG DGTINEMMQT PKVKLNLIHC YRSMNYISRH MEEKYGIPWF EYNFFGPTKI
AESLRAIAAL FDDTIKENAE KVIAKYEQQT AEVLAKYRPR LENKTVMMMV GGLRPRHVVP
AFTDLGMKMI GTGYEFAHGD DYKRTTEYVD DATLIYDDVT AYEFEKFVQE LKPDLVASGV
KEKYVFQKMG LPFRQMHSWD YSGPYHGYDG FAIFARDMDL ALNNPTWGLI KSPWNK