Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_0741 |
Symbol | |
ID | 4569953 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 842766 |
End bp | 844406 |
Gene Length | 1641 bp |
Protein Length | 546 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 639765337 |
Product | nitrogenase molybdenum-iron protein alpha chain |
Protein accession | YP_911218 |
Protein GI | 119356574 |
COG category | [C] Energy production and conversion |
COG ID | [COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains |
TIGRFAM ID | [TIGR01282] nitrogenase molybdenum-iron protein alpha chain [TIGR01862] nitrogenase component I, alpha chain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGTCGT ACAAAAATTT ACCGGATCCA TCCCTGGTCA GGGAGGATCT GATACAGAAA TACCCGACCA AAGTTGCAAA AAAACGCAAC AAGTCAATTG TAATCAATGA CCCGGAGACC ATTCCGGAGG TGCAGGCAAA CGTACGCACC GTTCCGGGAA TCATTACCCA GCGCGGCTGC TGCTATGCCG GTTGTAAAGG TGTTGTTCTT GGACCGACGC GTGATATCGT CAACATTGTT CACGGACCTA TCGGGTGCAG CTTCTATGCC TGGTTGACCC GTCGGAACCA GACGAGACCG GAAACACCGG AAGCCGAGAA CTATATCACC TACTGTTTCT CAACCGACAT GCAGGAGGAG AACGTTGTGT TCGGCGGCGA AAAGAAACTG AAGCAGGCCA TACAGGAAGC CTATGATCTC TTTCATCCGA AGGCTATCGC TATCTTTTCG ACCTGTCCGG TTGGTCTTAT TGGCGACGAT GTGCATGCCG CATCAAAAGA GATGCGTGAT AAATTCGGCG ACTGTAACGT TTTCGGGTTC AGTTGTGAAG GGTACCGGGG TGTCAGCCAG TCGGCAGGCC ACCATATTGC CAACAACGGC GTTTTCAAAC ACATGGTAGG ACGCAACAAC GCAGTCAAAG AGGGAAAGTT CAAATTAAAC CTGCTTGGTG AATACAATAT TGGCGGTGAT GCGTTTGAAA TCGAGCGCAT ATTCGAAAGA ACTGGCATCA CGCTTGTGGC ATCATTCAGC GGCAACTCGA CTGTCGGTCA GATTGAAAAT GCTCATACAG CCGATCTTAA CGTGATTCTC TGTCACCGGT CGATCAACTA CATGGGCGAG ATGATGGAAA CCAAATACGG TATCCCGTGG ATGAAAGTGA ACTTTGTCGG CGCTGAATCC ACAGCCAAAT CACTCAGAAA AATTGCTGAA TATTTTGGCG ATGAAGAGCT GAAAGCCCGG GTTGAAGCGG TAATTGCCGA AGAGATGCCA AAGGTGAAAG CGGTAATTGA TGAAATCAGA CCAAGAACCG AAGGCAAGAC CGCCATGCTT TTTGTTGGCG GGTCAAGGGC TCACCACTAT CAGGATCTTT TCAGCGAGCT TGGAATGACA ACGGTAGCAG CAGGGTATGA ATTCGCACAC CGCGACGACT ATGAAGGGCG CCATGTTCTG CCCGGCATAA AAATCGATGC CGACAGCAAG AACATCGAGG AGCTTAAAGT CACTGCGGAT CCGGAACTCT ACAATCCGAG AAAAAGTGAA GCCGAGCTTG AGGCGCTGAA AGAAAAAGGA CTCGAGATCA ACGGTTACGA AGGAATGATG AAGCAGATGC TGAAAAAAAC GCTCGTTGTT GACGACGTCA GCCACTATGA ATCGGAAAGA CTGATCGAGA TCTACAAGCC GGATATCTTC TGTGCAGGCA TCAAGGAGAA ATATGTCGTG CAGAAAATGG GCGTTCCGCT CAAGCAGCTT CACAGCTATG ACTACGGCGG TCCTTACACC GGTTTTGAAG GCGCACAGAA CTTCTACCGG GATATCGACC GGATGGTGAA CAATCCCGTC TGGAAGCTCA TCAAGGCCCC GTGGCAGAAA GCGGAAAACG GATCATCGAC AGCATTGGAA GCGAGTTACG TCACTCACTA A
|
Protein sequence | MQSYKNLPDP SLVREDLIQK YPTKVAKKRN KSIVINDPET IPEVQANVRT VPGIITQRGC CYAGCKGVVL GPTRDIVNIV HGPIGCSFYA WLTRRNQTRP ETPEAENYIT YCFSTDMQEE NVVFGGEKKL KQAIQEAYDL FHPKAIAIFS TCPVGLIGDD VHAASKEMRD KFGDCNVFGF SCEGYRGVSQ SAGHHIANNG VFKHMVGRNN AVKEGKFKLN LLGEYNIGGD AFEIERIFER TGITLVASFS GNSTVGQIEN AHTADLNVIL CHRSINYMGE MMETKYGIPW MKVNFVGAES TAKSLRKIAE YFGDEELKAR VEAVIAEEMP KVKAVIDEIR PRTEGKTAML FVGGSRAHHY QDLFSELGMT TVAAGYEFAH RDDYEGRHVL PGIKIDADSK NIEELKVTAD PELYNPRKSE AELEALKEKG LEINGYEGMM KQMLKKTLVV DDVSHYESER LIEIYKPDIF CAGIKEKYVV QKMGVPLKQL HSYDYGGPYT GFEGAQNFYR DIDRMVNNPV WKLIKAPWQK AENGSSTALE ASYVTH
|
| |