Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_2934 |
Symbol | |
ID | 4597435 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | + |
Start bp | 3113138 |
End bp | 3115999 |
Gene Length | 2862 bp |
Protein Length | 953 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639777539 |
Product | formate dehydrogenase |
Protein accession | YP_924123 |
Protein GI | 119717158 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCACGA ACGGAGCACC TGTGAACCCG CCCGTGAACC TGCCCGGCGC CCGCCGCCAC GAGGGACTGC GCAACTTCCC GCCGGTCGAG GACTGGGACC ACCACGTCGA GTACGACGCC AAGGCCCATC CGCGCAAGGT CCCGCACACC TACTCCCTGA TCCCGACGAT CTGCTTCAAC TGCGAGTCGT CGTGCGGGCT GCTGGCCTAC GTCGACCACG AGGACCTCTC GATCAAGAAG TTCGAGGGCA ACCCGGCCCA CCCGGGGTCC CGCGGCCGCA ACTGCGCCAA GGGCCCGGCC ACGATCAACC AGGTCCACGA CCCGGAGCGG ATCCTGCACC CGCTGCGCCG GGTCGGGGAG CGCGGCTCGG GGCAGTTCGA GCAGGTGAGC TGGGAGGAGG CGCTGACCGA CATCGCGACC CGGATCCGGG GCGCGATCGT CGAGGACCGG CGCGACGAGG TGATGTACCA CGTGGGTCGT CCGGGTGAGG ACGGCTTCGT CGAGCGGGTG CTCCAGGCCT GGGGCGTCGA CGGGCACAAC AGCCACACCA ACGTGTGCTC GTCGGGCGGG CGCAGCGGCT ACACCCACTG GATGGGCTAC GACCGGCCGT CGCCGGACTA CGCCAACGCC AAGACGATCC TGCTGCTGTC CTCGCACCTG GAGACCGGCC ACTACTTCAA CCCGCACGCC CAGCGGATCA TGGAGGCCAA GCAGAACGGC GCCAAGATCG CGGTGCTGGA CCTGCGGCTC TCGAACACCG CCTCGCACGC CGACCTGTGG ATCGCGCCGT GGCCGGGCAG CGAGGGAGCG ATCTTCCTGG CCGTGGCCTC CTACCTGCTG CGCACCGGGA GCATCGACCG TGAGTTCATC CGGCGCTGGG TGAACTGGGA CGTGTACCTC GAGCGGCTGC ATCCCGGGAC ACCGAAGGAC TTCGACGCGT TCCTGGCCCG GCTCACCGAG GACTACGCGC GCTACACCTT CGAGTACGCC GCGGAGGTGG CCCACGTCGA GGTCGAGCAG CTCGAGGCCC TGGCCCGGAT GGTGGCCGAT GGCGGCACCC GGCTCGCCAC CCACACGTGG CGCTCGGCCG CGGCCGGCAA CCTCGGCGGC TGGCAGATCA CCAGGTCGCT GTTCTTCCTC AACGTGCTGA CCGGCAGCGT CGGCACCGCG GGTGGCACCC ACCCCAACGG CTGGGACAAG ATCATCGCCC ACCATCCGCT GATGCCCGAG GCGAACGAGG CCTGGAGCGA GCTGCTGTGG CCGATCGAGT ACCCGTTGAC GACGAACGAG ATGTCGACGC TGCTGCCGCA CCTGCTCCAG GAGGGCCGCG GGAGGCTGGA GGTCTACTTC AGCCGGGTCT ACAACCCGCT GTGGGTCAAC CCCGACGGGT TCACCTGGAT CGACGTGCTC AAGGACGAAT CGAAGATCGG CCTGCACGTC GCGCTCACCC CGACCTGGAG CGAGTCCGCG GCCTTCGCCG ACTACGTGCT GCCCATGGGC CACGCGTCGG AACGCCACGA CACGATGTCC TTCGAGACGC ACACCTCCAA GTGGCTGGCC TTCCGCCAGC CGGTGACCCG GGTGGCGATG GAGCGGCGCG GCATCCCCTA CCGCGACAGC CGCGACACCA ACCCCGGTGA GGTCTGGGAG GAGAACGAGT TCTGGTTCGA GCTGTCCTGG CAGATCGACC CCGACGGCTC GCTGGGCATC CGCCGCTACT TCGAGTCCCC GGACCGCCCC GGGGAGCGGA TGACCCAGGA CGAGTACTAC GAGACGGTCT TCTCCACCAA CGTCCCAGGC CTGCCGGAGG CCGCGCGGGC ACAGGGCCTG TCACCGTTGC AGTACATGCG CAGGTACGGC GTGTTCGAGG TGGCGCGCGA CCTGTACCGG CTCGACGAGC GACCGCTCAG CGCGGCCGAG CTCGAAGGCG CGGTCCCCGA CGAGAACGGC GTCCTGCGCA AGCCGGTCAC CCTGGAGAGC CAGCCGCCGC TGGTCGGCGA GGCCGGCGCG GTCGGCCTCG TGCACGAGGA CGGCACCAGG ACGGCGGGCT GGCTGTCGCC GTCGCGCAAG CTCGAGCTGT ACTCCACGAC CCTCGCGGAC TGGGGCTGGC CGGAGCATGC CACCCCGGGC TACATCGAGT CCCACGTCGC CGCCACCCGG ATCGACCGTG CCGACCAGGA GTTCGTGCTG ATGCCGAACT TCCGGCTCCC CACCCTGGTG CACACCCGGT CCGGCAACGC CAAGTACCTC AACGAGATCG CCAACACGCA CCCGCTGTGG TTCAACACCG CGGACGCTGC CGCGCTGGAC GTCGGCACCG GCGACCTGGT CCGGGTCACC ACGGAGATCG GCCACTTCGT GGTCCGGGCC TGGGCGACCG AGGCGATCCG GCCGGGCGTC GTGGGCCTCT CCCACCACAT GGGCCGCTGG ATGCAGGACG GCCATCCCGG GAGCCGCTGG GTGATGGGCA AGGTGGACCT GCAGCGCAGC GACGAGGGGG TCTGGAAGCT GCGGTACGTC GACACCGTCA AGCCCTTCAC CAGCGAGGAT CCGGACTCCG AGCGGATCTA CTGGGACGAC CCCGGCGTGC ACCAGAACCT GGCGTTCCCG GTGCAGCCGG ACCCGGTCTC GGGCATGCAC TGCTGGCACC AGAAGGTGCG CATCGAGAAG GCGCACGAGG GTGACCGGTA CGGCGACGTG GAGGTCGACG TGAACAAGTC GCGCGAGGCC TACCGGCGCT GGCTGGCCAT CGCGCGACCG GGTCCCGGGC CGGACGGGCA GCGCCGCCCC GAGTTCTTGA TGCGGCACGT GACCCCCAAA CGCAAGGCCT ACCGGTTCGC GGCGGCCGGA GCGACGCGAT GA
|
Protein sequence | MTTNGAPVNP PVNLPGARRH EGLRNFPPVE DWDHHVEYDA KAHPRKVPHT YSLIPTICFN CESSCGLLAY VDHEDLSIKK FEGNPAHPGS RGRNCAKGPA TINQVHDPER ILHPLRRVGE RGSGQFEQVS WEEALTDIAT RIRGAIVEDR RDEVMYHVGR PGEDGFVERV LQAWGVDGHN SHTNVCSSGG RSGYTHWMGY DRPSPDYANA KTILLLSSHL ETGHYFNPHA QRIMEAKQNG AKIAVLDLRL SNTASHADLW IAPWPGSEGA IFLAVASYLL RTGSIDREFI RRWVNWDVYL ERLHPGTPKD FDAFLARLTE DYARYTFEYA AEVAHVEVEQ LEALARMVAD GGTRLATHTW RSAAAGNLGG WQITRSLFFL NVLTGSVGTA GGTHPNGWDK IIAHHPLMPE ANEAWSELLW PIEYPLTTNE MSTLLPHLLQ EGRGRLEVYF SRVYNPLWVN PDGFTWIDVL KDESKIGLHV ALTPTWSESA AFADYVLPMG HASERHDTMS FETHTSKWLA FRQPVTRVAM ERRGIPYRDS RDTNPGEVWE ENEFWFELSW QIDPDGSLGI RRYFESPDRP GERMTQDEYY ETVFSTNVPG LPEAARAQGL SPLQYMRRYG VFEVARDLYR LDERPLSAAE LEGAVPDENG VLRKPVTLES QPPLVGEAGA VGLVHEDGTR TAGWLSPSRK LELYSTTLAD WGWPEHATPG YIESHVAATR IDRADQEFVL MPNFRLPTLV HTRSGNAKYL NEIANTHPLW FNTADAAALD VGTGDLVRVT TEIGHFVVRA WATEAIRPGV VGLSHHMGRW MQDGHPGSRW VMGKVDLQRS DEGVWKLRYV DTVKPFTSED PDSERIYWDD PGVHQNLAFP VQPDPVSGMH CWHQKVRIEK AHEGDRYGDV EVDVNKSREA YRRWLAIARP GPGPDGQRRP EFLMRHVTPK RKAYRFAAAG ATR
|
| |