Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pnuc_1109 |
Symbol | |
ID | 5053199 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Polynucleobacter necessarius subsp. asymbioticus QLW-P1DMWA-1 |
Kingdom | Bacteria |
Replicon accession | NC_009379 |
Strand | - |
Start bp | 1151177 |
End bp | 1154068 |
Gene Length | 2892 bp |
Protein Length | 963 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640471279 |
Product | formate dehydrogenase, alpha subunit |
Protein accession | YP_001155889 |
Protein GI | 145589292 |
COG category | [R] General function prediction only |
COG ID | [COG3383] Uncharacterized anaerobic dehydrogenase |
TIGRFAM ID | [TIGR01591] formate dehydrogenase, alpha subunit, archaeal-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.612386 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGCAC CCACAAATCC GAAAGAACTG CAATTGCAAA CCGTTGAATT CAAATTAGAC GGCAAGACGA TTGTTTCTTA TGAAGGTGAA ACAATTCTCA AGGCAGCAAA GCGTCACGGT ATTGATATTC CGCATCTTTG CTTCAAAGAT GGTTATCGCC CAGACGGAAA TTGCCGTGCT TGTGTTGTCG AAATTAATGG TGAGCGCACC TTGGCGCCTA GCTGCTGCAG AACCGCCACT CCAGGCATGG AAGTGCAAGC CAATAGCGAG CGTGCTGTAA AGAGTCAAAA GCTTGTTTTG GAGATGTTGC TCTCTGATAT GCCAGACCAA GGTTTTAAAT GGGTGGGCGA TAGCAAGGCA GAAGAAGAGC AGCATCAGCA TGGTGAGCTC AGCACTTGGG CTGCACGTAT GGATGTCACT GTTCGTCCCG AGCTCAAAGC ACTGCGTAGA GATAAAGTCG CTCACGATAT CTCTCATCCT GCGATGGCTG TCAATCTTGA TGCCTGTATT CAATGTAACC GCTGTGTACG AGCTTGCCGT GAAGAGCAGG TGAATGATGT GATTGGTTAC GCGATGCGTG GTGGACATAG CGAAATTGTG TTTGATCTAA ACGATCCAAT GGGCGAGAGT ACCTGTGTTG CCTGCGGTGA ATGTGTGCAG GCTTGTCCTA CTGGCGCATT GATGCCAAAA GGATTGATTG GTTCACAAAT CGTCGATCGT AAAGTAGATT CTGTTTGCCC ATTCTGCGGA GTAGGTTGTC AAATTACCTA TAACGTGAAA GATGAAAAGA TTGTGAGTGT TGAGGGTCGT GATGGCCCAG CGAACCATAA TCGTTTGTGC GTTAAAGGTC GCTTTGGCAT GGATTACATC CATAACCCAC AGCGTTTAAC CAAGCCGCTG ATACGTAAAC CTGGCGTTCC TAAAGATGAA TCTTTGCTCG AGCGTGATCA AGATTGGTCT GAGATCTTCC GCGAAGCAAC ATGGGAAGAG GCGATTGAGT TTGCTGGCGG TGGGTTGAAG AAACTCAAAG ATCAGTACGG CAATAAAGTG TTAGCAGGAT TTGGTTCTGC AAAAGGAAGT AACGAAGAGG CGTATTTATT CCAAAAGCTA GTACGCACAG GCTTTGGCAG CAATAACGTA GACCATTGCA CTCGTCTTTG CCATGCTTCA TCTGTAGCAG CGCTTTTAGA GGGTGTGGGT TCTGGTGCTG TAAGTAATCA GGTAAACGAT GTTGAGCATT CCAGCTTAAT CATGTTGATT GGATCGAACC CAACAGCCAA TCACCCGGTT GCTGCAACTT GGTTTAAGAA TGCTGCTAAG CGTGGCGCAA AGATTGTGTT GTGCGATCCT CGTAAAACTG AAATCAGTAA ACATGCTTGG CGCACTATGC AGTTCAAGCC TGATACTGAT GTAGCGATGC TCAATGCCAT GATCTATACG ATCATTGAAG AAGGCTTGGT TGATCAAGAA TTTATCAAAA ATCGCTCTAA TAACTTTGAG GCACTCAAAG AAAATATCAA GGGGTATAGC CCTGAAGCGA TGGCGCCAAT CTGTGGTATC CCAGCTGAGA CACTGCGTGA GGTTGCTAGA GAATTTGCCA CTACTAAATC TGCGATGATT TTGTGGGGCA TGGGCGTGAG TCAACACGTG CATGGCACTG ATAACGCTCG CTGCTTAATT GCGCTCGTGA GTATTACTGG CCAAATTGGT AAACCGGGTT CTGGTCTACA TCCATTGCGT GGTCAAAACA ATGTGCAAGG TGCTAGTGAT GCTGGTTTGA TTCCGATGAT GTTCCCGAAC TATCAAAGGG TTGATAACCC ACAGGCGCAT GCTTGGTTTG AGAAATTCTG GGATACACCG CTTGATAAAA AGCCGGGTTA CACCGTAGTT GAAATCATGC ACAAAATCAC TGCACCTGAT AGCGATCCCG ATAAGATTCG CGGCATGTAT GTTGAAGGTG AGAACCCTGC GATGAGTGAT CCTGATTTGA ATCACGCACG ACACGCTTTG GCTTCTTTGG AGCATTTGGT GGTGCAAGAC ATCTTTATGA CTGAAACTGC TCTTCTCGCA GACGTTGTAT TGCCTGCAAG CGCTTGGCCA GAAAAGGTGG GTACTGCAAG TAATACTGAC CGTATGGTGC AAATGGGTAA AAAAGCCATT GAACCTCCTG GAGATGCAAA GCCAGATCTA TGGATCATTC AGGAGATTGC TAAACGCATG GGCTTGAACT GGAATTATCA AGGTTCTGAT GATGGAGTTG CTGAGGTCTA TGATGAAATG CGTCAAGCCA TGCATGCCGC TATTAATGGT ATTACTTGGG AGCGCTTGGA AAAAGAATCG AGTGTTACTT ATCCATGTTT ATCGCTTGAA GATCCGGGTC GCCCAATCGT TTTTGATGAT GAGTTTGCTA CTACCGATGG AAAAGTGAAG TTGGTTCCTG CGGATATCAT CCCCGCAAAT GAACGTCCTG ATTCTGAATT CCCGTTTGTC CTGATTACTG GTCGTCAACT TGAGCATTGG CATACTGGAA GCATGACTCG TCGCGCAACA GTGCTCGATG CTATTGAGCC TATGGCCACT GTATCGATGA ATGGTGAAGA TATGACTCAG CTAGGTGTGT CTGCTGGAGA TGTCATTACC GTTCAGTCTC GTCGCGGTGA GGTAGGTATT CATGTACGTC GAGATGACGG TACACCACGT GGTGTGATCT TTATTCCGTT TGCATACTAT GAGGCTGCTG CTAACCTCAT TACGAATTCT GCGCTAGATC CGGTTGGCAA GATTCCAGAA TTTAAGTATT GCGCGGTCAA GTTGGCTAAA GGTGGTCAAG CGTCCAAAGT AATGGGATAT GGCACTAACG ATCCAACACT CAATCCGGCG GCTATTGTCT AA
|
Protein sequence | MNAPTNPKEL QLQTVEFKLD GKTIVSYEGE TILKAAKRHG IDIPHLCFKD GYRPDGNCRA CVVEINGERT LAPSCCRTAT PGMEVQANSE RAVKSQKLVL EMLLSDMPDQ GFKWVGDSKA EEEQHQHGEL STWAARMDVT VRPELKALRR DKVAHDISHP AMAVNLDACI QCNRCVRACR EEQVNDVIGY AMRGGHSEIV FDLNDPMGES TCVACGECVQ ACPTGALMPK GLIGSQIVDR KVDSVCPFCG VGCQITYNVK DEKIVSVEGR DGPANHNRLC VKGRFGMDYI HNPQRLTKPL IRKPGVPKDE SLLERDQDWS EIFREATWEE AIEFAGGGLK KLKDQYGNKV LAGFGSAKGS NEEAYLFQKL VRTGFGSNNV DHCTRLCHAS SVAALLEGVG SGAVSNQVND VEHSSLIMLI GSNPTANHPV AATWFKNAAK RGAKIVLCDP RKTEISKHAW RTMQFKPDTD VAMLNAMIYT IIEEGLVDQE FIKNRSNNFE ALKENIKGYS PEAMAPICGI PAETLREVAR EFATTKSAMI LWGMGVSQHV HGTDNARCLI ALVSITGQIG KPGSGLHPLR GQNNVQGASD AGLIPMMFPN YQRVDNPQAH AWFEKFWDTP LDKKPGYTVV EIMHKITAPD SDPDKIRGMY VEGENPAMSD PDLNHARHAL ASLEHLVVQD IFMTETALLA DVVLPASAWP EKVGTASNTD RMVQMGKKAI EPPGDAKPDL WIIQEIAKRM GLNWNYQGSD DGVAEVYDEM RQAMHAAING ITWERLEKES SVTYPCLSLE DPGRPIVFDD EFATTDGKVK LVPADIIPAN ERPDSEFPFV LITGRQLEHW HTGSMTRRAT VLDAIEPMAT VSMNGEDMTQ LGVSAGDVIT VQSRRGEVGI HVRRDDGTPR GVIFIPFAYY EAAANLITNS ALDPVGKIPE FKYCAVKLAK GGQASKVMGY GTNDPTLNPA AIV
|
| |