Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_0738 |
Symbol | |
ID | 4569932 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 838633 |
End bp | 839985 |
Gene Length | 1353 bp |
Protein Length | 450 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 639765335 |
Product | nitrogenase |
Protein accession | YP_911216 |
Protein GI | 119356572 |
COG category | [C] Energy production and conversion |
COG ID | [COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACATG CAAAAACAGC AACACAAAAC GCCTGCAAAC TGTGCAACCC GCTCGGGGCA TGCCTTGCTT TCAGGGGAAT CGAAAACTGC GTACCCTTCC TGCACGGTTC ACAAGGTTGC GCAACCTATA TCCGTCGTTA CCTGATAAGC CATTACAAGG AACCGATCGA TATCGCTTCG TCGAACTTCA ATGAAGAAAC CGCCGTGTTT GGAGGCAGTC ATAACCTGCA GCTTGGACTG AAAAACGTCA CCGAGCAGTA CAAGCCTGAG GTTATCGGAC TGGCCACAAC ATGCCTGAGC GAAACCATCG GGGATGATGT GCCGATGATC CTTCGCGACT ATAAAAAAGC GTTTAAAAAC GGTACGCCAA TGCCGATAAT GATTCATGCC TCAACGCCAA GCTATCAGGG AAGCCACATC GACGGCTTTC ATGCCGCTGT CAGGGCAAGC GTTAAAACCC TTGCTGAAAA AGGGGCACGA AAAAACCTGA TCAATATCTT CCCGAACATG ATCTCGCCGG CGGATATTCG TTACATCAAG GAGATTCTCT CCGATTTCAG CGTGCCCTAT ATGCTCCTGC CAGACTACTC ACAGACGATG GACGGCGGGC CATGGGGCGA ATACCACCGC ATCCCGCCAG GAGGGACACC TGCCGGAGCC ATTGCCGGTG CAGGATGCGC AACGGCAAGT ATCGAGTTCG GCTCTACCCT TGAATCCTCA AAATCTGCCG CCGGCTACCT TGAGGAGACA TTCGACGTTC CCCGTTACCC TCTTGCCCTG CCGATCGGAA TAAACGAGAG CGACAGACTG TTCAACCTGC TTGAAAAACT GACCGAACAG AAAATGCCGG AAAAATATGA GGATGAACGA CGCCGTCTGG TTGACGCTTA TGCCGATGGG CACAAATATG TTTTTGAGAA AAAGGTCATT CTGTACGGAG AGGAAGACCT GGTGATCTCC ATGGCAGCGT TTCTGCGTGA AATCGGCATG ACCCCCGTAC TCTGCGCGTC GGGAGGCAAA AGCGGTCTGA TGAGAAAAAA GCTTCTTGAA CTGATTCCCG ACCTTGAGGA ACAGGGCATC AGGATACGTG ACGGAGTGGA CTTTGTCGAT ATTGAAGACG AAGCCAAAGT ACTGCTCCCT GACCTCCTGA TCGGCAATAG TAAAGGCTTT ACGATGGCAC GAAAAAACAA CATTCCGCTG ATGAGAATCG GCTTTCCGAT TCATGACCGG TTCGGTGGAC AGCGAATGCA TCATATCGGG TATCGCGGTA CCCAGGAACT CTTTGACCGG ATAGTCAACA CCGTTATCGA ACAACGGCAG AATGCTTCAT CAATAGGCTA TACCTACATG TAA
|
Protein sequence | MKHAKTATQN ACKLCNPLGA CLAFRGIENC VPFLHGSQGC ATYIRRYLIS HYKEPIDIAS SNFNEETAVF GGSHNLQLGL KNVTEQYKPE VIGLATTCLS ETIGDDVPMI LRDYKKAFKN GTPMPIMIHA STPSYQGSHI DGFHAAVRAS VKTLAEKGAR KNLINIFPNM ISPADIRYIK EILSDFSVPY MLLPDYSQTM DGGPWGEYHR IPPGGTPAGA IAGAGCATAS IEFGSTLESS KSAAGYLEET FDVPRYPLAL PIGINESDRL FNLLEKLTEQ KMPEKYEDER RRLVDAYADG HKYVFEKKVI LYGEEDLVIS MAAFLREIGM TPVLCASGGK SGLMRKKLLE LIPDLEEQGI RIRDGVDFVD IEDEAKVLLP DLLIGNSKGF TMARKNNIPL MRIGFPIHDR FGGQRMHHIG YRGTQELFDR IVNTVIEQRQ NASSIGYTYM
|
| |