Gene Ppha_1952 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPpha_1952 
Symbol 
ID6463080 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePelodictyon phaeoclathratiforme BU-1 
KingdomBacteria 
Replicon accessionNC_011060 
Strand
Start bp2038466 
End bp2040100 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content50% 
IMG OID642728155 
Productnitrogenase molybdenum-iron protein alpha chain 
Protein accessionYP_002018785 
Protein GI194336991 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01282] nitrogenase molybdenum-iron protein alpha chain
[TIGR01862] nitrogenase component I, alpha chain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0543638 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGGCGA ACAGAGTTTA TCCGGATCCT TCCCAGGTCA GGGAGGAACT GATACAAAAA 
TATCCGGCCA AGGTTGCAAA AAAACGGGCC AAGTCGATCA TCATCAATGA CCCGGAGATC
ATTCCTGAGG TGCAGGCCAA CGTGCGTACC GTACCGGGTA TCATCACACA GCGCGGCTGT
TCTTATGCTG GATGTAAAGG TGTTGTGCTC GGCCCTACCC GTGATATTGT CAACATCGTA
CACGGACCAA TCGGTTGCAG CTTTTATGCC TGGTTGACCC GCCGGAACCA GACCAGACCC
GAGACCTTGC TGGATGAGAA CTATATCCCT TACTGTTTTT CAACGGACAT GCAGGAGGAG
AATATCGTCT TTGGTGGTGA AAAGAAGCTG AAAATTGCAA TCCAGGAGGC TTATGACCTC
TTTCATCCAA AGTCCATTGC CATCTTCTCG ACCTGTCCGG TTGGCCTGAT TGGTGATGAC
GTTCATGCCG CTTCACGTGA AATGAAGGAG AAACTGGGAG ACTGCAACGT TTTCGGTTTC
AGTTGCGAAG GGTACCGGGG TGTCAGCCAG TCGGCAGGCC ATCACATTGC CAACAACGGT
GTGTTCAAGC ACATGGTTGG CCGCAACAAC ACGCCGAGCG TGGGCAAGTT CAAGCTGAAC
CTGCTGGGTG AATACAACAT CGGCGGTGAC GCTTTTGAGA TTGAACGCAT TTTCAAGAAG
GTCGGCATTA CTCTTGTGGC CTCATTCAGT GGCAACTCGA CGGTCGGCCA GATTGAAAAC
GCTCACACTG CCGATCTGAA CGTGATCCTT TGTCACCGGT CGATCAACTA TATGGGTGAC
ATGATGGAGA CGAAGTACGG AATTCCGTGG ATGAAGATCA ACTTTGTCGG AGCAGAATCA
ACGGCAAAGT CGCTCCGCAA AATTGCTGAA TACTTTGGCG ACGAGGAGCT CAAGGCGAAG
GTTGAGGCTG TGATTGCCGA AGAGACACCA AAGGTGAAAG CGGTGATTGA GGAGATATTG
CCAAGGACAA AAGGCAAAAC TGCCATGCTC TTTGTCGGTG GATCACGTGC CCATCACTAC
CAGGATCTTT TTTCCGAGCT GGGCATGACG ACGGTAGCTG CAGGGTACGA GTTTGCTCAC
CGCGATGATT ACGAAGGGCG TGACGTACTG CCTAAAATCA AGATTGACGC CGACAGCAAG
AATATTGAGG AGCTGAAAGT GGTCGCAGAT CCCGACTTCT TCAACCCGAG AAAAACCGAA
GCGGAACTTG AAGCGCTGAA AGAAAAGGGG CTTGAAATCA ACGGTTATTC CGGAATGATG
AAGCAGATGA CCAGTAAATC GCTGGTTGTT GATGACCTCA GCCACTATGA GTCTGAAAAG
CTGATCGAGA TCTACAAGCC GGATATTTTC TGCGCCGGTA TCAAGGAGAA GTATGTGGTT
CAGAAGATGG GTATTCCGTT GAAACAGCTT CACAGCTACG ACTACGGTGG ACCTTACACT
GGCTTTGAAG GAGCGATAAA CTTCTACAGA GACATCGACC GTATGGTAAA CAATCCCGTT
TGGAAGCTGA TCAAGGCTCC ATGGGAAAAA GCCGGAAACG GTGCAGGACT TGCGGCCAGT
TACGTGACAC AGTAA
 
Protein sequence
MEANRVYPDP SQVREELIQK YPAKVAKKRA KSIIINDPEI IPEVQANVRT VPGIITQRGC 
SYAGCKGVVL GPTRDIVNIV HGPIGCSFYA WLTRRNQTRP ETLLDENYIP YCFSTDMQEE
NIVFGGEKKL KIAIQEAYDL FHPKSIAIFS TCPVGLIGDD VHAASREMKE KLGDCNVFGF
SCEGYRGVSQ SAGHHIANNG VFKHMVGRNN TPSVGKFKLN LLGEYNIGGD AFEIERIFKK
VGITLVASFS GNSTVGQIEN AHTADLNVIL CHRSINYMGD MMETKYGIPW MKINFVGAES
TAKSLRKIAE YFGDEELKAK VEAVIAEETP KVKAVIEEIL PRTKGKTAML FVGGSRAHHY
QDLFSELGMT TVAAGYEFAH RDDYEGRDVL PKIKIDADSK NIEELKVVAD PDFFNPRKTE
AELEALKEKG LEINGYSGMM KQMTSKSLVV DDLSHYESEK LIEIYKPDIF CAGIKEKYVV
QKMGIPLKQL HSYDYGGPYT GFEGAINFYR DIDRMVNNPV WKLIKAPWEK AGNGAGLAAS
YVTQ