Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acel_0509 |
Symbol | |
ID | 4486506 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidothermus cellulolyticus 11B |
Kingdom | Bacteria |
Replicon accession | NC_008578 |
Strand | - |
Start bp | 541202 |
End bp | 543088 |
Gene Length | 1887 bp |
Protein Length | 628 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639729276 |
Product | molydopterin dinucleotide-binding region |
Protein accession | YP_872268 |
Protein GI | 117927717 |
COG category | [C] Energy production and conversion |
COG ID | [COG5013] Nitrate reductase alpha subunit |
TIGRFAM ID | [TIGR01580] respiratory nitrate reductase, alpha subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.337326 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGGTA CGTCGTTCTT TTACCTCAAC ACGGATCAAT GGCGATACGA AGCATTCCGT GCCGATGAAT TCGCCAGCCC GCTGGGCGGT GACACGTTCC GCGGCGCATC CTTCGTCGAC TGCCTGGCCC GGGCAGCACG GCTGGGCTGG ACGCCGTCCT TCCCCACCTT CAACCGCAAC AGCCTCGACC TCGCAGACGA AGCGGCCCGG GCCGGCGTCC CGGTCTCCGA CTACGTCGTG CAGGAATTGT GTTCCGGCCG GCTCCGCTTC GCCTGCGAAG ATCCTGACGA CCCGGTGAAT TTCCCTCGCA TCCTCACGGT CTGGCGCGCA AACCTCCTTG GCTCCTCGGG AAAGGGCATG GAGTACTTCA TGCGCCATCT GCTCGGTACG GACGACGCCG TCCGGGCCGA GGAGACGGCG CCACCGTTGC GGCCGGCCGA CGTGACGTGG CGCGATCCCG CGCCCCGGGG GAAACTCGAC CTCCTCACCG CCATCGATTT CCGGATGACG AGCACCTGCA CCTACGCCGA CATCGTCCTC CCAGCCGCCA CCTGGTACGA AAAGCACGAC ATTTCCACCA CGGACATGCA TCCGTTCGTC CATTCGTTCA ATCCGGCGAT CCCACCACCG TGGGAGGCGA AGACCGACTT CGAGATATTT CACCGGCTCG CAGAGGTTTT CAGCCGGCTC GCCGGCGGAC GACTCGGCCG CCGGACGGAT GTCATCGCGG CGCCGCTCGG CCACGACACA CCCGACGAGT TGGCCACCCC GGGCGGCGTC GTCCGGGATT GGCGGCGCGG CGAGTGTGAG CCCGTCCCCG GCCGGACGAT GCCGAAAATT GTGACCGTCG AGCGTGACTA TGGCGCAATT GCCGAGAAGA TGCGGGCGCT CGGTCCGCTC GTCGACTCGC TCGGAACGTC AGCGAAGGGA ATCACCTGGA CGCCGCGGCA GGCGGTGGCT TACCTGCAGG CGGCGAACGG CGTGATCCGC GGCGGCGTCG CGGACGGCCG CCCGTCGTTG GCCCGCGACG TCCACCTCGC CGAAGCGATT CTCGCGCTGT CCGGAACGAC GAACGGTCAC ATCGCACTCC AGGCCTGGCA GGCACTCGAA GAGCGGACCG GTATGCCGTT ACGGGATCTG GCCGCGGAAC GGGCCGAGGA GCAGATCCGG TTCGCGGATA CGCAGGTGCA GCCGCGTGCG GTCATCACCT CGCCGGAGTG GTCCGGGGCG GAGACCGGCG GACGCCGCTA TTCACCGTTC GTCGTCAATG TCGAACGAAA GAAGCCGTGG CACACCCTGA CCGGCCGCAT GCACTTCTTT CTCGACCACG ACTGGATTCA GGCGTACGGC GAGGCGTTGC CGGCGTACCG GCCGCCGCTG GACTATCCCC GCTTCTTCGG CGACCAGCAG ATCGGGGACG GAACACCGGA AATCACGGTG CGCTACCTGA CACCGCACTC GAAGTGGTCG ATCCACTCGG AATACCAGGA CAATCTCCAC ATGCTCCGCC TTTTCCGCGG CGGCCCGGTG ATTTGGATGA GTCCCCGCGA CGCCGCCAAA ATCGGCGTCT CCGACAACGA CTGGATCGAG GCGTACAACC GCAACGGCGT CGTCGTCGCC CGGGCCGTGG TCACCCATCG GATGCCGGAG GGAACGGTCT TCATGTACCA CGCCAAGGAC CGGCACCTCA TGACCCCGAA ATCCGAGATC TCCGGTTGGC ATGGGGGCTC GGACAATTCC TTGACCCGCG TCGTCATCAA GCCCACGCAC CTCATCGGAG GCTACGCTCA GCTGAGCTAC GCCTTCAACT ACTACGGCCC AACCGGCAAT CAACGTGACG AAATCACGGT GATCCGGCGG CGGTCCCAGG AGGTGGCGTA CCAATGA
|
Protein sequence | MAGTSFFYLN TDQWRYEAFR ADEFASPLGG DTFRGASFVD CLARAARLGW TPSFPTFNRN SLDLADEAAR AGVPVSDYVV QELCSGRLRF ACEDPDDPVN FPRILTVWRA NLLGSSGKGM EYFMRHLLGT DDAVRAEETA PPLRPADVTW RDPAPRGKLD LLTAIDFRMT STCTYADIVL PAATWYEKHD ISTTDMHPFV HSFNPAIPPP WEAKTDFEIF HRLAEVFSRL AGGRLGRRTD VIAAPLGHDT PDELATPGGV VRDWRRGECE PVPGRTMPKI VTVERDYGAI AEKMRALGPL VDSLGTSAKG ITWTPRQAVA YLQAANGVIR GGVADGRPSL ARDVHLAEAI LALSGTTNGH IALQAWQALE ERTGMPLRDL AAERAEEQIR FADTQVQPRA VITSPEWSGA ETGGRRYSPF VVNVERKKPW HTLTGRMHFF LDHDWIQAYG EALPAYRPPL DYPRFFGDQQ IGDGTPEITV RYLTPHSKWS IHSEYQDNLH MLRLFRGGPV IWMSPRDAAK IGVSDNDWIE AYNRNGVVVA RAVVTHRMPE GTVFMYHAKD RHLMTPKSEI SGWHGGSDNS LTRVVIKPTH LIGGYAQLSY AFNYYGPTGN QRDEITVIRR RSQEVAYQ
|
| |