Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Daud_0146 |
Symbol | |
ID | 6026717 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Desulforudis audaxviator MP104C |
Kingdom | Bacteria |
Replicon accession | NC_010424 |
Strand | + |
Start bp | 165256 |
End bp | 166704 |
Gene Length | 1449 bp |
Protein Length | 482 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 641593002 |
Product | nitrogenase component I, alpha chain |
Protein accession | YP_001716346 |
Protein GI | 169830364 |
COG category | [C] Energy production and conversion |
COG ID | [COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains |
TIGRFAM ID | [TIGR01282] nitrogenase molybdenum-iron protein alpha chain [TIGR01284] nitrogenase alpha chain [TIGR01862] nitrogenase component I, alpha chain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGTTTC TCAGGCTCAA GTGTGATGAA CTGATCCCCG AACGGGAAAA GCACACTTAC ATCACGGATA GGGAGAACCC CGTCATCCCG CTTTGCAACA TCAACACGAT CCCGGGGGAT ATGACCGAGC GCGGGTGAGC GTTCGCGGGT GCCCGCGGGG TCGTGGGCGG ACCAATCACC GATGCTATCC AGATCGTGCA CGCGCCGGTC GGATGTGCCT ATTACACCTG GGCGACGCGG CGTCACCTTT CCGACCAGTA CGCCTGGAGC ATGCCCGGCC GGCTGGACAA CGTGGCCTTC AACCGCCGCT TCTGTGTGTG CACCGACATG GAGGAAAAGG ATGTTGTCTT CGGCGGGACC AAGAAACTCC TAAAGAGCGC GCTGGAGGCG GTAAGACTGT TTCCGGAAGC GACGGGGATC ATCATGTACA CCACCTGCAC CACCGGTCTG ATCGGGGACG ATATCGGGTC GGTGGCCAAG CAAATCGAAC GGGAGACCGG AAAGCCGGTG TTCTTCGCGG AGGCGCCCGG CTGTTCCGGG GTGAGCCAGT CCAAGGGTCA TCACGTGGCG AACCGGCAGT TTTTCGAACA GATCAACGAG ATCCGGCGCC GGCGCCCGGA ACTGATCACA CCCGAGGCGG AGCGGACGCC TTACGACGTC GTCCTGGTCG GGGAGTACAA CATGGACTGG GACCTCAAGG TGATCCTGCC GCTGATGGAG AGCATCGGGA TGCGGGTGGT AAGCACCTTC ACCGGAAACG CCCGGATGAT GGACCTGGTC CGGCTGCCGG ATACCAAGCT CAACGTGGTG CACTGCCAGC GGTCGGCAAC CTACATCGCG GATATGATCA AGGAAGGCTA CGACATTCCC TACGTCAAGG CGTCGTTCTT CGGTATCCAG CAGACGAGCA AGGCCCTGCG GACCATTGCC CGCCATTTTG GTCTGGAGGA GCGGGCGGAG CAGGTCATCG CCGCGGAGAC GATCCGCATC CAGCCGGCCC TGGAGTGGTA CCGCGAGCGC CTGCAGGGCA AGACCGTGGC CGTCTACGTC GGGGGGCCCC GGGTCTGGCA CTGGATCAAG CTCTTTGAGG AACTGGGCAT GAAGGTGGTG GCCGGAGCCT GCACCTTCGC CCACGAGGAC GACTACGAGA AGATCAACGC CCGCGCCGGT GACGGGGTGC TGATCATCGA CAACCCGAAC GAGTTCGAGA TCGAGGAGTT GCTGGAAACC TGTAAACCGG ATATCTTCCT CTGCGGCCTG AAGGAGAAGT TCCTGGGCCG CAAGATGGGC GTGCCCACCC TGAACTCCCA TTCCTACGAG AAAGGCCCCT ACGCGGGGTA CGTGGGGTTC ATCAACTTCG CCCGCGACAT CTACCAGGCG CTTTACGCCC CGGTATGGCG CCTGACCAAC GGAAAGGAGG CCGTTACCAA TTATGGCCGG CAACTGTAA
|
Protein sequence | MPFLRLKCDE LIPEREKHTY ITDRENPVIP LCNINTIPGD MTERGUAFAG ARGVVGGPIT DAIQIVHAPV GCAYYTWATR RHLSDQYAWS MPGRLDNVAF NRRFCVCTDM EEKDVVFGGT KKLLKSALEA VRLFPEATGI IMYTTCTTGL IGDDIGSVAK QIERETGKPV FFAEAPGCSG VSQSKGHHVA NRQFFEQINE IRRRRPELIT PEAERTPYDV VLVGEYNMDW DLKVILPLME SIGMRVVSTF TGNARMMDLV RLPDTKLNVV HCQRSATYIA DMIKEGYDIP YVKASFFGIQ QTSKALRTIA RHFGLEERAE QVIAAETIRI QPALEWYRER LQGKTVAVYV GGPRVWHWIK LFEELGMKVV AGACTFAHED DYEKINARAG DGVLIIDNPN EFEIEELLET CKPDIFLCGL KEKFLGRKMG VPTLNSHSYE KGPYAGYVGF INFARDIYQA LYAPVWRLTN GKEAVTNYGR QL
|
| |