Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cag_1251 |
Symbol | |
ID | 3748289 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium chlorochromatii CaD3 |
Kingdom | Bacteria |
Replicon accession | NC_007514 |
Strand | + |
Start bp | 1714963 |
End bp | 1716234 |
Gene Length | 1272 bp |
Protein Length | 423 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 637773789 |
Product | nitrogenase cofactor biosynthesis protein NifB |
Protein accession | YP_379555 |
Protein GI | 78189217 |
COG category | [R] General function prediction only |
COG ID | [COG0535] Predicted Fe-S oxidoreductases |
TIGRFAM ID | [TIGR01290] nitrogenase cofactor biosynthesis protein NifB |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0000233074 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACAAG ATATTACCAA ACACCCCTGT TTTAACGATT CAGCTCGCCA CACCTTTGGG CGAATTCACC TCCCTGTTGC GCCTAAGTGC AACATTCAGT GCAACTATTG CAGTCGCAAA TTCGACTGCA TGAACGAGAA CCGTCCGGGA GTTACCAGCA AAGTGCTCTC GCCCCAGCAA GCACTCTACT ACTTAGATCA AGCAATGGAG CTTTCACCAA ACATTGCTGT TGTAGGCATT GCAGGACCAG GCGATCCCTT TGCTAACCCC GATGAAACAA TGGAAACCTT GCGCCTTGTT CGGGCAAAAT ATCCCGAAAT GCTGCTATGC GTTGCCACCA ACGGGCTTGA TTTGCTACCA TATATTGACG AGCTGGCTCG CTTGCAAGTA AGCCATGTAA CCATCACCAT TAACGCCATT GATCCTGAAA TTGGTCAAGA AATTTATGCG TGGGTGCGTT ACAACAAAAA AATGTACAGA GGGAAGGATG CCGCAAAAGT GCTCATCAAC AACCAGCTTG AAGCTCTCAA ACGGTTAAAA GAGGTGGGCG TTACAGCAAA AGTAAACTCC ATTATTATTC CCGGCATTAA TGATGCGCAC GTTATTACCG TAGCCTCCAA AGTAGCCGAA TTGGGAGCCG ACATCTTGAA CTGCTTACCC TACTACAACA CTAAAGAGAC CGTTTTTGAA AATATAGATG AACCATCGCC TGAGCTTGTT TTTGAAATTC AAAAAGCCAC CAGCGAATTT TTACCCCAAA TGAAACACTG CGCCCGCTGC CGCGCCGATG CAGTTGGCAT TATTGGTGAA ATCAACTCAC CCGAAATTAT GGAGAAAATG GCAGAAGTAG CCGCAATGGC GAAAAACCCA TTTGAGCAGC GCCCCTACAT AGCCGTTGCA AGCATGGAAG GCGTATTGGT AAACCAACAT CTTGGCGAAG CCGACCGCCT TTTAGTGTAT GGCATAGACG AACAAGGCGA CTGCGTTTTA GTGGATTCAC GCCAAACACC ACCAGCAGGC GGCGGCAACG AACGGTGGGA AGCCCTTGCC AACTTACTCA GCGACTGCCG CACCGTATTA GTAAGCGGAA TCGGCAACTC GCCCAAAAAA GTACTCAACA ACAACGGCGT TGAAGTGCTT GTTATGGAAG GCGTTATAGC CGAAGCCGTT TACGCACTCT TTAACGGACA CGACATGCGC CACCTTATAA AAACCGAACT TGCCCACGCA TGTGGCACAA ACTGTTCAGG CACAGGCGCC GGCTGCGGTT AA
|
Protein sequence | MKQDITKHPC FNDSARHTFG RIHLPVAPKC NIQCNYCSRK FDCMNENRPG VTSKVLSPQQ ALYYLDQAME LSPNIAVVGI AGPGDPFANP DETMETLRLV RAKYPEMLLC VATNGLDLLP YIDELARLQV SHVTITINAI DPEIGQEIYA WVRYNKKMYR GKDAAKVLIN NQLEALKRLK EVGVTAKVNS IIIPGINDAH VITVASKVAE LGADILNCLP YYNTKETVFE NIDEPSPELV FEIQKATSEF LPQMKHCARC RADAVGIIGE INSPEIMEKM AEVAAMAKNP FEQRPYIAVA SMEGVLVNQH LGEADRLLVY GIDEQGDCVL VDSRQTPPAG GGNERWEALA NLLSDCRTVL VSGIGNSPKK VLNNNGVEVL VMEGVIAEAV YALFNGHDMR HLIKTELAHA CGTNCSGTGA GCG
|
| |