Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ppha_1952 |
Symbol | |
ID | 6463080 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pelodictyon phaeoclathratiforme BU-1 |
Kingdom | Bacteria |
Replicon accession | NC_011060 |
Strand | + |
Start bp | 2038466 |
End bp | 2040100 |
Gene Length | 1635 bp |
Protein Length | 544 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 642728155 |
Product | nitrogenase molybdenum-iron protein alpha chain |
Protein accession | YP_002018785 |
Protein GI | 194336991 |
COG category | [C] Energy production and conversion |
COG ID | [COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains |
TIGRFAM ID | [TIGR01282] nitrogenase molybdenum-iron protein alpha chain [TIGR01862] nitrogenase component I, alpha chain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0543638 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGGCGA ACAGAGTTTA TCCGGATCCT TCCCAGGTCA GGGAGGAACT GATACAAAAA TATCCGGCCA AGGTTGCAAA AAAACGGGCC AAGTCGATCA TCATCAATGA CCCGGAGATC ATTCCTGAGG TGCAGGCCAA CGTGCGTACC GTACCGGGTA TCATCACACA GCGCGGCTGT TCTTATGCTG GATGTAAAGG TGTTGTGCTC GGCCCTACCC GTGATATTGT CAACATCGTA CACGGACCAA TCGGTTGCAG CTTTTATGCC TGGTTGACCC GCCGGAACCA GACCAGACCC GAGACCTTGC TGGATGAGAA CTATATCCCT TACTGTTTTT CAACGGACAT GCAGGAGGAG AATATCGTCT TTGGTGGTGA AAAGAAGCTG AAAATTGCAA TCCAGGAGGC TTATGACCTC TTTCATCCAA AGTCCATTGC CATCTTCTCG ACCTGTCCGG TTGGCCTGAT TGGTGATGAC GTTCATGCCG CTTCACGTGA AATGAAGGAG AAACTGGGAG ACTGCAACGT TTTCGGTTTC AGTTGCGAAG GGTACCGGGG TGTCAGCCAG TCGGCAGGCC ATCACATTGC CAACAACGGT GTGTTCAAGC ACATGGTTGG CCGCAACAAC ACGCCGAGCG TGGGCAAGTT CAAGCTGAAC CTGCTGGGTG AATACAACAT CGGCGGTGAC GCTTTTGAGA TTGAACGCAT TTTCAAGAAG GTCGGCATTA CTCTTGTGGC CTCATTCAGT GGCAACTCGA CGGTCGGCCA GATTGAAAAC GCTCACACTG CCGATCTGAA CGTGATCCTT TGTCACCGGT CGATCAACTA TATGGGTGAC ATGATGGAGA CGAAGTACGG AATTCCGTGG ATGAAGATCA ACTTTGTCGG AGCAGAATCA ACGGCAAAGT CGCTCCGCAA AATTGCTGAA TACTTTGGCG ACGAGGAGCT CAAGGCGAAG GTTGAGGCTG TGATTGCCGA AGAGACACCA AAGGTGAAAG CGGTGATTGA GGAGATATTG CCAAGGACAA AAGGCAAAAC TGCCATGCTC TTTGTCGGTG GATCACGTGC CCATCACTAC CAGGATCTTT TTTCCGAGCT GGGCATGACG ACGGTAGCTG CAGGGTACGA GTTTGCTCAC CGCGATGATT ACGAAGGGCG TGACGTACTG CCTAAAATCA AGATTGACGC CGACAGCAAG AATATTGAGG AGCTGAAAGT GGTCGCAGAT CCCGACTTCT TCAACCCGAG AAAAACCGAA GCGGAACTTG AAGCGCTGAA AGAAAAGGGG CTTGAAATCA ACGGTTATTC CGGAATGATG AAGCAGATGA CCAGTAAATC GCTGGTTGTT GATGACCTCA GCCACTATGA GTCTGAAAAG CTGATCGAGA TCTACAAGCC GGATATTTTC TGCGCCGGTA TCAAGGAGAA GTATGTGGTT CAGAAGATGG GTATTCCGTT GAAACAGCTT CACAGCTACG ACTACGGTGG ACCTTACACT GGCTTTGAAG GAGCGATAAA CTTCTACAGA GACATCGACC GTATGGTAAA CAATCCCGTT TGGAAGCTGA TCAAGGCTCC ATGGGAAAAA GCCGGAAACG GTGCAGGACT TGCGGCCAGT TACGTGACAC AGTAA
|
Protein sequence | MEANRVYPDP SQVREELIQK YPAKVAKKRA KSIIINDPEI IPEVQANVRT VPGIITQRGC SYAGCKGVVL GPTRDIVNIV HGPIGCSFYA WLTRRNQTRP ETLLDENYIP YCFSTDMQEE NIVFGGEKKL KIAIQEAYDL FHPKSIAIFS TCPVGLIGDD VHAASREMKE KLGDCNVFGF SCEGYRGVSQ SAGHHIANNG VFKHMVGRNN TPSVGKFKLN LLGEYNIGGD AFEIERIFKK VGITLVASFS GNSTVGQIEN AHTADLNVIL CHRSINYMGD MMETKYGIPW MKINFVGAES TAKSLRKIAE YFGDEELKAK VEAVIAEETP KVKAVIEEIL PRTKGKTAML FVGGSRAHHY QDLFSELGMT TVAAGYEFAH RDDYEGRDVL PKIKIDADSK NIEELKVVAD PDFFNPRKTE AELEALKEKG LEINGYSGMM KQMTSKSLVV DDLSHYESEK LIEIYKPDIF CAGIKEKYVV QKMGIPLKQL HSYDYGGPYT GFEGAINFYR DIDRMVNNPV WKLIKAPWEK AGNGAGLAAS YVTQ
|
| |