Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cag_1247 |
Symbol | |
ID | 3748285 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium chlorochromatii CaD3 |
Kingdom | Bacteria |
Replicon accession | NC_007514 |
Strand | + |
Start bp | 1708852 |
End bp | 1710486 |
Gene Length | 1635 bp |
Protein Length | 544 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 637773785 |
Product | nitrogenase molybdenum-iron protein alpha chain |
Protein accession | YP_379551 |
Protein GI | 78189213 |
COG category | [C] Energy production and conversion |
COG ID | [COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains |
TIGRFAM ID | [TIGR01282] nitrogenase molybdenum-iron protein alpha chain [TIGR01862] nitrogenase component I, alpha chain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0023858 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGGAAA AACTTATGAC ATCCGACCCA GCGCAAGTGC GGGAGACGTT GATACAAAAA TATCCACCAA AGGTGGCTAA AAAGCGAGCA AAGTCCATTG TTATTAATGA CCCTGAAATA GTACCCGAAG TACAAGCTAA CGTAAGAACG GTACCGGGCA TTATTACACA ACGTGGTTGT GCGTATGCTG GTTGTAAAGG TGTGGTGCTT GGTCCAACAC GCGACATTGT CAATATAGTA CACGGTCCAA TTGGATGCAG CTTTTATGCG TGGTTAACCC GCCGTAACCA AACGCGCCCC GAAAGTCCAG AACATGCCAA CTACATCACC TACTGTTTTT CAACCGATAT GCAGGAAGAA AACGTGGTGT TTGGTGGTGA GAAAAAACTC AAAGTGGCAA TTCAAGAGGC TTATGACCTC TTCCACCCAA AATCAATTGC TATTTTTTCA ACCTGCCCTG TAGGTTTAAT TGGTGATGAC GTTCATGCGG CAGCAAGAGA GATGAAGGAG AAGTTTGGCG ACTGCAACGT CTTTGGTTTT AGTTGCGAAG GGTATCGCGG CGTTAGTCAA TCAGCAGGAC ACCACGTTGC CAACAACGGT GTTTTTAAGC ATATGGTTGG TCGCGACAAC ACGGTAAAGC CCGGCAAATT CAAGCTTAAC CTGCTTGGTG AGTACAACAT TGGCGGCGAC GCTTTTGAGC TTGAGCGCAT TTTCGAGAGA GTTGGTATTA CGTTAGTTGC CTCGTTTAGT GGCAACTCAA CGGTTGGTGC GTTGGAAAAC TCACACACCG CCGACCTCAA CATTATTATG TGCCATCGCT CCATTAACTA CATGGGCGAT ATGATGGAGA CGAAATACGG TATTCCGTGG ATGAAGGTGA ACTTTGTAGG TGCCCAATCA ACCGCCAAAT CGTTACGCAA AATTGGTGAA TACTTTGGTG ATGAAGAGCT GAAAGCGCGT ATTGAAGCGG TGATTGCCGA AGAGATGCCA AAAGTGGAAG CCGTTATTAA TGAAATTCGT CCACGCACCG AAGGCAAAAC TGCCATGCTC TTTGTAGGTG GCTCACGAGC GCACCACTAC CAAGATCTCT TTACCGAGCT TGGCATGACA ACAATTGCAG CAGGTTACGA ATTTGCTCAC CGCGACGACT ACGAAGGGCG CGAAGTGCTA CCAAAAATCA AAATTGATGC CGACAGCAAA AACATTGAAG AGCTGAAGGT TGAAGCCGAT CCCGAGCTTT ACAAACAAAG AAAAAGTGAA GCTGAGCTTG AAGAGCTAAA GGCAAAAGGA TTAGAGATTA ATGGCTACGA AGGCATGATG AAGCAGATGA CGAAAAAGTC GCTTGTGGTG GACGATGTAA GCCACTATGA ATCTGAAATG CTGATTGAAA TGTACAAGCC CGACATTTTC TGTGCTGGTA TTAAAGAGAA ATATGTGGTG CAAAAAATGG GCGTGCCGCT CAAACAGCTT CATAGCTACG ACTACGGCGG ACCTTACACA GGTTTTGAAG GCGCACTTAA CTTCTACCGC GATATTGACC GAATGGTAAA CAATCCTGTT TGGAAGCTTA TTAAAGCTCC ATGGGAAAAA GCTGAAAATG GCGGAGTACT TGAAGCCGCT TACGTTCAAG GATAA
|
Protein sequence | MEEKLMTSDP AQVRETLIQK YPPKVAKKRA KSIVINDPEI VPEVQANVRT VPGIITQRGC AYAGCKGVVL GPTRDIVNIV HGPIGCSFYA WLTRRNQTRP ESPEHANYIT YCFSTDMQEE NVVFGGEKKL KVAIQEAYDL FHPKSIAIFS TCPVGLIGDD VHAAAREMKE KFGDCNVFGF SCEGYRGVSQ SAGHHVANNG VFKHMVGRDN TVKPGKFKLN LLGEYNIGGD AFELERIFER VGITLVASFS GNSTVGALEN SHTADLNIIM CHRSINYMGD MMETKYGIPW MKVNFVGAQS TAKSLRKIGE YFGDEELKAR IEAVIAEEMP KVEAVINEIR PRTEGKTAML FVGGSRAHHY QDLFTELGMT TIAAGYEFAH RDDYEGREVL PKIKIDADSK NIEELKVEAD PELYKQRKSE AELEELKAKG LEINGYEGMM KQMTKKSLVV DDVSHYESEM LIEMYKPDIF CAGIKEKYVV QKMGVPLKQL HSYDYGGPYT GFEGALNFYR DIDRMVNNPV WKLIKAPWEK AENGGVLEAA YVQG
|
| |