Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_1754 |
Symbol | |
ID | 6375441 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | + |
Start bp | 1895417 |
End bp | 1897039 |
Gene Length | 1623 bp |
Protein Length | 540 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 642684247 |
Product | nitrogenase molybdenum-iron protein alpha chain |
Protein accession | YP_001960153 |
Protein GI | 189500683 |
COG category | [C] Energy production and conversion |
COG ID | [COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains |
TIGRFAM ID | [TIGR01282] nitrogenase molybdenum-iron protein alpha chain [TIGR01862] nitrogenase component I, alpha chain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGTCAA GAAAATACCC TGAACCTTCC GTAGTCAGGG AGGAGTTGAT AAAGAAATAT CCGCCCAAGG TTGCCAAAAA AAGAGGCAAA GCGATTGTCA TCAACGATCC TGAAACGATC CCGCCGGTAC AGGCCAATAT CAGGACCATA CCCGGGATTA TCTCACAGAG GGGATGTTCT TACGCAGGAT GTAAAGGGGT TGTTCTGGGC CCGACAAGAG ACATTGTCAA TCTGGTGCAC GGACCTATCG GATGCAGTTT TTACGCCTGG CTGACCCGAA GAAACCAGAC GCGTCCTGAC GGACCGGATG ATAAAAACTA TATGACCTAC TGCTTCTCAA CCGACATGCA GGAAGAACAT GTTGTTTTCG GTGGAGAGAA AAAGCTGAAA GAGGCAATCC AGGAAGCCTA CGACATATTC CGTCCAAAAG CTATCGGCAT TTTCTCAACC TGTCCGGTAG GCCTTATCGG AGATGACGTT CACGCAGTAG CCAGAGAGAT GAAAGAAAAA CTTGGCGACT GCAATATTTT CGGCTTCAGC TGTGAAGGCT ACAAAGGTGT CAGCCAGTCG GCCGGACACC ATATCGCCAA TAACCAGGTC TTCAAACATG TTGTCGGCCT TGATGACACC GACAAGGGCG GAAAGTTCAA GATCAACATG CTTGGTGAAT ACAATATCGG AGGTGACGCT TTCGAGATCG AGCGGCTGCT TGAAAAATGC GGCATAACAA TGGTCGCGAG CTTCAGCGGC AACTCGACGG TCAACCAGTT TGAAAACTCT CACACCGCCG ATCTGAACGT GATCATGTGC CACCGCTCGA TCAACTATAT GGCCGAGATG ATGGAAACGA AATATGGCAT TCCCTGGATG AAAGTCAACT TCATCGGCGC GGAATCATCC GCAAAATCAC TCCGCAGAAT CGCCAGGTAT TTTGAAGATG AAGAACTGAT GGCAAAGGTC GAGCAGGTCA TAGCCGAGGA ACTGCCGGTC GTCCAGTCGG TGATCAACGA GATCTACCCG AGAACAAAAG GTAAGCTCGC TATGCTCTTC GTCGGCGGGT CCAGGGCTCA CCACTATCAG GAGCTGTTTG GTGAACTGGG TATGGAAACC ATCTCGGCAG GTTACGAGTT CGGACATCGG GACGACTATG AAGGGCGGAA GGTCATTCCG AATATCAAAG TAGATGCTGA CAGCAAGAAC ATCGAGGAGC TCAAGGTTAC CGCTGATCCG GAAAAATTCA AACCGAGAAA AACGGAAGAA GAACTTGAAA AGCTCAAAGC TGAAGGCCTT GACATCAGGG AATACGAAGG CATGATGCCC GATATGAAAA AAGGATCTAT CGTCATCGAT GATATCAGCC ATTACGAAAG CGAAAAGCTG ATTGAACTCT ACAAACCTGA TATTTTCTGC GCCGGCATCA AGGAAAAATA TGTCGTGCAG AAAATGGGCG TCCCCTTGAA ACAGCTTCAC AGCTATGACT ACGGGGGGCC TTATGCCGGA TTTAAAGGCG CGATTAACTT TTACAGGGAT ATCGACAGAA TGGTCAACAG CCGGGTCTGG AAACTTATCC AGGCGCCTTG GGAAGAAACG ACTGAGCTGG AAGCTAACTA TGTCACACAG TAA
|
Protein sequence | MESRKYPEPS VVREELIKKY PPKVAKKRGK AIVINDPETI PPVQANIRTI PGIISQRGCS YAGCKGVVLG PTRDIVNLVH GPIGCSFYAW LTRRNQTRPD GPDDKNYMTY CFSTDMQEEH VVFGGEKKLK EAIQEAYDIF RPKAIGIFST CPVGLIGDDV HAVAREMKEK LGDCNIFGFS CEGYKGVSQS AGHHIANNQV FKHVVGLDDT DKGGKFKINM LGEYNIGGDA FEIERLLEKC GITMVASFSG NSTVNQFENS HTADLNVIMC HRSINYMAEM METKYGIPWM KVNFIGAESS AKSLRRIARY FEDEELMAKV EQVIAEELPV VQSVINEIYP RTKGKLAMLF VGGSRAHHYQ ELFGELGMET ISAGYEFGHR DDYEGRKVIP NIKVDADSKN IEELKVTADP EKFKPRKTEE ELEKLKAEGL DIREYEGMMP DMKKGSIVID DISHYESEKL IELYKPDIFC AGIKEKYVVQ KMGVPLKQLH SYDYGGPYAG FKGAINFYRD IDRMVNSRVW KLIQAPWEET TELEANYVTQ
|
| |