Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_1757 |
Symbol | |
ID | 6375444 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | + |
Start bp | 1899986 |
End bp | 1901359 |
Gene Length | 1374 bp |
Protein Length | 457 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 642684250 |
Product | Nitrogenase |
Protein accession | YP_001960156 |
Protein GI | 189500686 |
COG category | [C] Energy production and conversion |
COG ID | [COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0799606 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAACAA CAGCAAAAGC AGCGACTCAG AACGCATGTA AACTCTGTAA TCCACTGGGA GCCTGCCTCG CGTTCAGAGG CATAGAAAAG TGCGTTCCCT TTCTTCACGG ATCACAGGGA TGCGCAACCT ATATTCGACG ATACCTTATC AGCCATTTCA AGGAACCGGT TGATATCGCG TCATCAAACT TCAATGAAGA TACGGCTGTA TTCGGCGGCA GCCACAACCT GCAACTGGGG TTGAAAAATG TCACGCTCCA GTATAAACCT GAGGTCATCG GGCTGGCGAC GACCTGCCTG TCGGAAACAA TCGGGGACGA TGTCGACATG ATCCTCCGCG ACTATGACAA ACTTTTTGAA AACGGAGAAC CGTTACCCAA CGGAAAACCG CTCCCATTGA TGATCCATGC GTCAACCCCC AGTTATCAGG GCAGCCACAT CGACGGGTTT CATGCCGCGG TTAAAGCGAC GGTTGAAACA ATTGCTGAAA GCGGACAAAA AGAGAATCTT CTGAACCTCT ATCCCAACAT GGTTTCTCCC GCGGACCTCA GACACATGAA GGAGATCCTC AAAGACTTCA ACATTCCCTA CGTCCTGCTG CCTGACTATT CGGAGACTCT TGACGGAGGA CCGTGGGATG AATATCACAG AATTCCGAAA GGCGGCACAA CGGTCAGCGC GATCAGAAAA AGCGGCAAGG CCGCTGCAAG TCTGGAATTC TCATCGGTAC TGACCGCAGA CAAGTCAGCT GCCGTATATC TGGAAAAGAA GTTCGATGTA CCTGCATATT CCATGACGTT GCCGATCGGC ATCAAACAGA GCGACGCGTT TTTCGGACTG CTCGAAAAGC TCTCAGAGAC TCCTATGCCT GAAAAATATG AAGATGAGCG GAGAAGACTT GTCGATGCTT ATGCAGACGG GCACAAGTAC ATTTTCGAGA AAAAAGCGAT TGTGTACGGT GAAGAGGATC TGGTGATCGC CATGACTGCG TTTCTGACAG AGATCGGCAT CACTCCTGTA CTGTGCGCTT CCGGAGGAAA AAGCGGTCAC CTGAAAAAAC GGATTGAAGA GATCGTTCCC GACAGTGAAA ATACCGGCAT ACTCGTCCGT GATGGTGTTG ATTTTGTTGA TATCGAGGAT GAGGCGAAAG TCCTGAAGCC CGATCTTCTC ATCGGCAACA GTAAAGGCTA CACCATGTCA AGGAAAAACA ACACTCCCAT CATCAGGATA GGATTTCCTA TCCATGACCG GTTCGGAGGA CAGCGTCAAC TTCATCTCGG TTATCGCGGG ACACAGGAAC TGTTCGACAG AATCGTCAAT ACCATTCTTC AAGAGAGACA GAATTCATCA CCAATCGGAT ATACATACCA GTAA
|
Protein sequence | MKTTAKAATQ NACKLCNPLG ACLAFRGIEK CVPFLHGSQG CATYIRRYLI SHFKEPVDIA SSNFNEDTAV FGGSHNLQLG LKNVTLQYKP EVIGLATTCL SETIGDDVDM ILRDYDKLFE NGEPLPNGKP LPLMIHASTP SYQGSHIDGF HAAVKATVET IAESGQKENL LNLYPNMVSP ADLRHMKEIL KDFNIPYVLL PDYSETLDGG PWDEYHRIPK GGTTVSAIRK SGKAAASLEF SSVLTADKSA AVYLEKKFDV PAYSMTLPIG IKQSDAFFGL LEKLSETPMP EKYEDERRRL VDAYADGHKY IFEKKAIVYG EEDLVIAMTA FLTEIGITPV LCASGGKSGH LKKRIEEIVP DSENTGILVR DGVDFVDIED EAKVLKPDLL IGNSKGYTMS RKNNTPIIRI GFPIHDRFGG QRQLHLGYRG TQELFDRIVN TILQERQNSS PIGYTYQ
|
| |