Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ajs_3470 |
Symbol | |
ID | 4674265 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidovorax sp. JS42 |
Kingdom | Bacteria |
Replicon accession | NC_008782 |
Strand | - |
Start bp | 3664910 |
End bp | 3667801 |
Gene Length | 2892 bp |
Protein Length | 963 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639840503 |
Product | formate dehydrogenase, alpha subunit |
Protein accession | YP_987660 |
Protein GI | 121595764 |
COG category | [R] General function prediction only |
COG ID | [COG3383] Uncharacterized anaerobic dehydrogenase |
TIGRFAM ID | [TIGR01591] formate dehydrogenase, alpha subunit, archaeal-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.873157 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.153183 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGCCG CACCGCACAC CGGCGCAGGC GCCGTGCCGT TCGAGCTCGA CGGCGAACTC TTGCAGGCCC TTCCGGGCGA AACCCTGTGG CAGGCCGCGC GCCGCTACGG CGTGCAGATC CCCCACCTGT GCCACACCGA TGGCCTGCGG CCCGACGGTA ACTGCCGCGC CTGCGTGGTG GAGATCGAAG GCGAGCGCAC GCTGGCGCCC AGCTGCTGCC GCGCGCCCAC GCCCGGCATG CGCGTGCACG CCCAGAGCCC CCGCGCGCTC AAGAGCCAGC AGATGGTGCT GGAACTGCTG CTGTCCGACA TGCCAGAGAC CGGCTACAAG TGGAACGACG CTACAAAAAT AATAGCTGAT AGCGCTTGCC AGGAGAGCGT TGGAGCACAA TTTGACCAAG TACCAACGCC CCCGGGGCAG CACGGCGAGT TGAGCGCCTG GGCCGATGCG CTGGGCGTGG CCGTGCGTCC CGCGCTGGCC GCGCTGCGCC GCGCGCAGCC CGCGCCCGAC CTGAGCCACC CGGCCATGGC CGTGCACCTG GACGCGTGCA TCCAGTGCAA CCGCTGCGTG CGCGCCTGCC GCGAGGAACA GGTCAACGAC GTGATCGGCT ATGCACTGCG CGGAGCCGAC AGCAAGATCG TCTTCGACCT GGACGACCCC ATGGGCGCCA GCACCTGTGT GGCCTGCGGC GAATGCGTGC AGGCCTGCCC CACGGGCGCG CTGAGTCCCA AGACCCACAT CGGCCCGCAG AAGGTGGACC GCACGGTGGA CTCGGTCTGC CCGTTCTGCG GCGTGGGCTG CCTCGTCACC TACCACGTGA AGGACGAAAA AATCGTGCGC GTGGACGGCC GTGACGGCCC GGCCAACCGG GGGCGCCTGT GCGTGAAGGG GCGCTTTGGC TTTGACTACG CGCACCACCC GCAGCGCCTC ACGGTGCCGC TGGTGCGCAA ACCTGGCGTT CCCAAGGACT GGCAGGGCGC TGTGAGGCCC GAGGACTGGC GCGAGGTCTT CCGCGAGGCG ACCTGGGACG AGGCACTGCA GCGCGCCGCA GGCGGACTGC GTGCGCTGCG CGATGCGCAC GGCCCCAAGG CGCTGGCGGG GTTTGGCTCG GCCAAGGGCA GCAACGAAGA GGCCTACCTG TTCCAGAAGC TGGTGCGCAC CGGCTTTGGC AGCAACAACG TGGACCATTG CACGCGCCTG TGTCACGCCT CCAGCGTGGC GGCGCTGCTG GAGGGCGTGG GCTCGGGCGC GGTGAGCAAC CCGGTGAGCG ACGTGGCGCA TGCCGAGGTT ATTCTGGTGA TCGGCTCCAA CCCCACATCC AACCACCCCG TGGCCGCCAC ATGGATCAAG AATGCCGCCC GGCGGGGTGC AAAGATCGTG CTGGCCGACC CGCGCGTGAC CGACATCGGG CGCCACGCCT GGCGCGTGCT GCAGTTCCGG CCCGACACCG ATGTGGCCGT GCTCAACGCG CTGATCCACA CGGTGATCGA GGAGGGCCTG GCCGACGAGG CCTTCATCCG TGACCGTGCG CTCAACTACG AGGCCCTGCG AGAGAACGTG CGCGCCTACA GCCCTGAGGC CATGGCGCCG CTCTGCGGCA TTCCCGCGGA CACGCTGCGT GAGGTGGCGC GCGCGTTCGC CACGGCCAAG GCGTCGATGG TCCTCTGGGG CATGGGCGTG AGCCAGCATG TGCATGGCAC CGACAACGTG CGCTGCCTGA TTGCCCTCGC CACCGTCACC GGGCAAATCG GCCGGCCCGG CACGGGCCTG CACCCGCTGC GCGGCCAGAA CAACGTGCAG GGCGCAAGCG ACGCGGGCCT GATCCCGATG ATGTTCCCCA ACTACCAGCG CGTGGACAAC GCCGAGGCGC GCGCGTGGTT CGAGCAGTTC TGGGGCTTGC CGCTGGACGA CGCGCCGGGT TACACCGTGG TCGAGATCAT GCACAAGGCC CTGGCCCATG AGGCCGACCC GCACAAGGTG CGCGGCATGT ACATCATGGG CGAGAACCCG GCCATGAGCG ACCCTGACCT GAACCATGCG CGCCAGGCGC TCGCGAGCCT GGCGCACCTC GTGGTGCAGG ACATCTTCTT GACCGAAACC GCCTGGCTGG CCGACGTGGT GCTGCCCGCC AGCGCCTGGC CCGAGAAGAC CGGCAGCGTG AGCAACACCG ACCGCATGGT GCAGCTGGGC CGGCGCGCGC TCACGCCACC GGGTGACGCG CGGCCCGATC TGTGGATCGT GCAACAGATG GCGAAGGGCA TGGGCCTGGC CTGGGACTAC GAGGGCGCAG AAGCTGGCGT GGCTGCCGTC TACGAGGAAA TGCGCCAGGC CATGGCCGGC GCCATCGCGG GTATCAGTTG GGACCGGCTA GAGCGCGAGT CCAGCGTGAC CTACCCCTGC CTCTCGGCCG AGGACCCGGG CCAGCCCATC GTATTCACCG AGCGCTTTCC CACCCCCACG GGCCGTGTGA CGTTGGTGCC GGCCGACATC ATCCCGGCCG ACGAGCGGCC CGATGCCCAC TACCCGCTGG TGCTCATCAC GGGCCGCCAG TTGGAGCACT GGCATACGGG CAGCATGACG CGGCGATCGG CCGTGCTGGA CGCCATCGAG CCGCACGCCA CGGCCTCGCT GCACGGCGAC GAACTGGCGC GCCTAGGGCT GGCGCCGGGC GACTGGGCGG CGATCCGCTC GCGCCGTGGT GCCGTGCAGG TGCGTGTGCG CCGCGACGAC GGCACGCCGC GCGGTGCGGT GTTCATGCCG TTTGCCTATG TGGAAGCGGC GGCCAACCTG CTCACCAATG CGGCGCTGGA CCCGTTCGGG AAGATCCCCG AGTTCAAGTA TTGCGCCGTG GCCGTGCGGG GCATTGCGCC GCCGCAGGGG CAGGGCGGCT GA
|
Protein sequence | MSAAPHTGAG AVPFELDGEL LQALPGETLW QAARRYGVQI PHLCHTDGLR PDGNCRACVV EIEGERTLAP SCCRAPTPGM RVHAQSPRAL KSQQMVLELL LSDMPETGYK WNDATKIIAD SACQESVGAQ FDQVPTPPGQ HGELSAWADA LGVAVRPALA ALRRAQPAPD LSHPAMAVHL DACIQCNRCV RACREEQVND VIGYALRGAD SKIVFDLDDP MGASTCVACG ECVQACPTGA LSPKTHIGPQ KVDRTVDSVC PFCGVGCLVT YHVKDEKIVR VDGRDGPANR GRLCVKGRFG FDYAHHPQRL TVPLVRKPGV PKDWQGAVRP EDWREVFREA TWDEALQRAA GGLRALRDAH GPKALAGFGS AKGSNEEAYL FQKLVRTGFG SNNVDHCTRL CHASSVAALL EGVGSGAVSN PVSDVAHAEV ILVIGSNPTS NHPVAATWIK NAARRGAKIV LADPRVTDIG RHAWRVLQFR PDTDVAVLNA LIHTVIEEGL ADEAFIRDRA LNYEALRENV RAYSPEAMAP LCGIPADTLR EVARAFATAK ASMVLWGMGV SQHVHGTDNV RCLIALATVT GQIGRPGTGL HPLRGQNNVQ GASDAGLIPM MFPNYQRVDN AEARAWFEQF WGLPLDDAPG YTVVEIMHKA LAHEADPHKV RGMYIMGENP AMSDPDLNHA RQALASLAHL VVQDIFLTET AWLADVVLPA SAWPEKTGSV SNTDRMVQLG RRALTPPGDA RPDLWIVQQM AKGMGLAWDY EGAEAGVAAV YEEMRQAMAG AIAGISWDRL ERESSVTYPC LSAEDPGQPI VFTERFPTPT GRVTLVPADI IPADERPDAH YPLVLITGRQ LEHWHTGSMT RRSAVLDAIE PHATASLHGD ELARLGLAPG DWAAIRSRRG AVQVRVRRDD GTPRGAVFMP FAYVEAAANL LTNAALDPFG KIPEFKYCAV AVRGIAPPQG QGG
|
| |