Gene PCC7424_2119 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC7424_2119 
Symbol 
ID7108848 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 7424 
KingdomBacteria 
Replicon accessionNC_011729 
Strand
Start bp2370592 
End bp2372022 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content45% 
IMG OID643480376 
Productnitrogenase molybdenum-iron protein alpha chain 
Protein accessionYP_002377414 
Protein GI218439085 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01282] nitrogenase molybdenum-iron protein alpha chain
[TIGR01862] nitrogenase component I, alpha chain 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.0308502 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAACGG TAGATGACAG AAAGCAGTTA ATTCAAGATG TTCTTGATGA TTATCCTGAG 
AAGCTAGCCA AAAAACGGGC GAAACACCTC AATGTTTATG AAGAAGGCAA AGCAGATTGT
GGCGTAAAAT CTAACGTTAA GTCTGCGCCT GGAGTAATGA CTGCCCGTGG TTGTGCTTAT
GCAGGATCTA AAGGGGTGGT TTGGGGTCCG ATTAAGGATA TGATCCATAT CTCTCATGGT
CCGATTGGTT GTGGATATTA TTCTTGGTCT GGTCGTCGTA ACTACTACAT CGGAACCACT
GGGGTTGATA CCTTTGGAAC GATGAATTTT ACCTCCGATT TCCAAGAACG CGACATCGTC
TTTGGGGGAG ACAAAAAACT CCTGAAGATT ATGCAGGAAA TCGAGACACT GTTCCCCCTC
AACAATGGGG TTTCTGTTCA GTCTGAATGT CCCATCGGAT TGATTGGGGA TGACATTGAG
GCTGTAGCCC GTAAAGCGAG TAAAGAATCG GGTAAGCCGG TTGTTCCCGT CCGTTGTGAA
GGATTCAGAG GTGTTTCTCA GTCTTTGGGT CACCACATTG CTAATGATGC GGTTCGGGAT
TGGGTTTTCA GTCGCACTGA TGCCCCTGAA ATAGAAACCA CTCCTTATGA TGTGGCGATT
ATTGGAGACT ACAATATCGG TGGTGATGCT TGGTCTTCTC GCATTCTCTT AGAAGAAATT
GGTCTGCGCG TTGTGGCTCA ATGGTCTGGG GATGGTACGA TCAATGAGAT GATGCAAACT
CCCAAGGTTA AACTCAATCT GATCCACTGT TACCGCTCGA TGAACTACAT CAGCCGTCAC
ATGGAGGAAA AATATGGTAT TCCTTGGTTT GAGTATAATT TCTTCGGGCC GACTAAGATT
GCTGAGTCTT TAAGAGCGAT CGCTGCTTTA TTCGACGATA CGATTAAAGA AAATGCAGAA
CGAGTGATTG CTAAATATAC CAAGCAAACC GAGGAGGTTC TAGCTAAGTA TCGTCCTCGC
CTGGAAGGTA AGAAAGTGAT GATGATGGTG GGCGGTTTAC GTCCTCGTCA CGTTGTTCCT
GCTTTCACCG ATTTAGGGAT GGAAATGATT GGTACTGGGT ATGAGTTCGC TCACGGGGAT
GATTACAAGC GGACTACTGA GTATATTGGT GATGCGACTC TGGTCTATGA TGATGTCACT
GCTTATGAGT TTGAGAAGTT CGTCCAAGAA CTGAAGCCGG ATTTAGTGGC TTCTGGGGTT
AAGGAAAAAT ACGTCTTCCA AAAGATGGGT TTACCTTTCC GTCAGATGCA CTCTTGGGAT
TATTCTGGTC CTTATCATGG TTATGATGGT TTTGCTATCT TTGCTCGTGA TATGGATTTG
GCTCTCAATA ATCCGACTTG GGGCTTGATT AAGTCTCCTT GGAAGAAGTA A
 
Protein sequence
MSTVDDRKQL IQDVLDDYPE KLAKKRAKHL NVYEEGKADC GVKSNVKSAP GVMTARGCAY 
AGSKGVVWGP IKDMIHISHG PIGCGYYSWS GRRNYYIGTT GVDTFGTMNF TSDFQERDIV
FGGDKKLLKI MQEIETLFPL NNGVSVQSEC PIGLIGDDIE AVARKASKES GKPVVPVRCE
GFRGVSQSLG HHIANDAVRD WVFSRTDAPE IETTPYDVAI IGDYNIGGDA WSSRILLEEI
GLRVVAQWSG DGTINEMMQT PKVKLNLIHC YRSMNYISRH MEEKYGIPWF EYNFFGPTKI
AESLRAIAAL FDDTIKENAE RVIAKYTKQT EEVLAKYRPR LEGKKVMMMV GGLRPRHVVP
AFTDLGMEMI GTGYEFAHGD DYKRTTEYIG DATLVYDDVT AYEFEKFVQE LKPDLVASGV
KEKYVFQKMG LPFRQMHSWD YSGPYHGYDG FAIFARDMDL ALNNPTWGLI KSPWKK