Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dole_3133 |
Symbol | |
ID | 5695993 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfococcus oleovorans Hxd3 |
Kingdom | Bacteria |
Replicon accession | NC_009943 |
Strand | - |
Start bp | 3760488 |
End bp | 3763508 |
Gene Length | 3021 bp |
Protein Length | 1006 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641265750 |
Product | von Willebrand factor type A |
Protein accession | YP_001531013 |
Protein GI | 158523143 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG4548] Nitric oxide reductase activation protein |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTTCAG TCAACCATCC CGGCGACCGG CTGCCGGAAG CGGCGGATAT CGCCTTTTTT TCCAGTATCT CTAAAGAATG CGGCCAGGCG TTTGCAAGCG CGGAGAAACA GGCCCGGGCG CTTCTTTCTG AAAGCGCGTA TGCCACCTGG GTGTCACTTG CCAGAAAGAT TCACCGGTCA TTCCCGGATA CAGACGGCCC GGTGCGGGCG TACCTTGATT CCAGCCGGCC GTTTTTTTGC GAGGACGGGC TTGGCTACCT GAAAAACTGG GTTACCGAGA GTATCAAAAT CGGATCATGG TCGACGGCCT GCGCCAAAGA CTTTTTAATG GCCACCCCGG CTTTTCTGGC GCACGCCCGT TTTGCCAAGA TCAACCAGCT GGCGTCGGAC ATAAAGTACA TTCTTGACGC GGAAAACGGG GGCGAAGCGA CCGCGACCGC GTTTATAAAA ACGTCGGCAA CCATCCTGCG GTATCTGTCA CCAAGAGTCT ACAAGATATG GAAAGAGAGC GGCTTCCGTA TTTTAAGGCA AAACAGGGAT AAGGGGACCC AATATTTTTC CATGGAACCG GAAGGGCTTG ACCGGCTTTA TCTTTCTGAA ACCACAAAAA TTTTTAAGAT TACAGCCATC GCCTTTAACT CGACGCCTGA AAAGGCCGGT GCTTTCTATG AGACGCTCCC CAACCGCATT CTGCGAATAA ATCCCAACCT GCGCGACAAA ATTCTGGAAA AGATTCTTGA GATGGCATCC GGAAGACCGG ATGAAATCAT CGAGGACATG AACGCTATGG CCCTGTCGCT TGGCTCCTTT TCCAACCCGG TCCAGCAGAC GATCTTCGAT CTGGGCAAAC AACTGGATGA AATTTCAAAA AAGGCGTTCC GGGCCTATTA TCAAAACGTG AAGCATGTCC TTGAAAATAT TCCTGTTTAC TTTCTGGTCA ACTGGGTGAG CCGGGGAATG GACCTCCTTC TTGAAAACAA AAAAGAGGGC GTCACCTATT TTGCAATGGA AAGCCCGGAA GCCGGGGCCG AACTGGTTAA ATGGGGAAGC GCCGCCTTTC TCGAACAGCA CCGGGAAATG CTCTCACTGT TCTGCCACGC CCTGTGCGGG AAGAAGGTCC GTATTCGAAG CAATGATGAG ATGGCCGAAT CAGAGAGGAA CCGGCTGGGT CTTATAGCGG AGGACACGGG CTTTATTTTT TTTCTTTCTT CCTATGTGGC GGAAGAAGAT AACGCGGCGG CCAACATCCG TTATTACAAG ACGGCTGCCG CGCTTAAGGC GGGATATATC GAGTTCGGCA CCCTGGCCCC TGAATTCGCG GGCATATGGC AGCTGCTGGA GTCGTTTCCT GACAGGGAGT TGGCCCTTGA TATTTTTCAC ATCCTTGAGG ACGGCAGGAT TTTCTATAAT TTGAAAAAAA ACTATCCCGG TCTTTCTCCG GAGATTGAGC GCACAATTGA AAACGCCCTT TTAAAAAGGG ATGTCCCCCG GGACGACCTT TTTGGCGCGG CCCTTGAATT GCTCTTACGG CTATCCCTCG GGTATGCTGC TGATATAAAC ATGGACGCCC CCTTTTCCAA GCCCCTTGCC GGTGTTTATG CCGACCTGAA AGACCATCTC GCGGCGTTTC CCGCGCAAGC AGAAACCGTT CTTGATTCCT ACACGATAAC GGCACGGGTC TATGACACAC TCAGCGCACT GGTCCGGCAA AAGTCCCGAA AGGCGGCTTT GCCATTGCCG CTTTGCATGA ATAAAGAACC GGAGGAAACA GAAGGAACCG GACCCATGGT AATGCCCCCG AATACAATTG TGGAGGGGGA CGGTTCAGAA ACAGGGACGG ATATCACCCT GACACCGGAG GAGCTGGAAC GGCTCCTTGA CATGGCCCAG GACATCACCC TGTTGAGCAT GCTTACGCCG GTGCCTGCGG CGAACAGGTT CTACCTTTCC GACCTTGATA ATTTTACGGT AAAAGACGGC GGCGATGAAG CCCGGGACCC GGGCAGTGTT GATATAAAAC AGGGGGTGAC CACCGGGAAG ACGGTTGGTA AAGCCGGGAC CGGCAAGAAA AAATACTATT ATGACGAGTG GGATTTTCTT GCCAGAGAAT ACAGGACAAA ATGGTGCTGC CTCAGGGAAA AGGAGCCCCG CCAGAGCGAC CCGGACATGT ATCACAGGAT TTACGCGGAA TACGGCGACC TTATCCGCAA AACGCGGGCC CAGTTCCAGC GGATCCGGCC GGCGTCGTTG GATATTATCC ACAATGTGGA CCAGGGGGAT GAGATCGATC TTACCGCCCT GATTCGACAC GTTGTTGATA AAAAAGCGGG GGCCGTTCCT TCGGACAGGG TGTTCTGCAG AAAAGACAAA AAGATACGGC ACATGTCAAC GCTACTGCTG ATCGACATGA GCGCCTCCAC GGAAGAAACC GCGCCAGAGG TATCGGCAGA AGATTCGCAG GATAAAAAAG GGGGAAAATC ATCCCGCGAC GACAAGCGGG TGATTGACAT TGAGAAGGAG AGCCTGATTG TCATGTCCGA AGCGCTGGAC GCGCTGGGCG ACCAGTACGC CATGTACGGG TTTTCCGGTC ATGGCAGGGA ACACGTGGAC TACTATGTGA TCAAGTCCTT TGACGAGTCC AACACGGAAA AGGTGAAAAT GCGCATCTGC GGCATTGAGC CCAGGCAGAG TACGCGCATG GGCACCGCTA TCCGCCACGC CGTTTCCAAA CTCAGCAACC GTGAGGCGGA CCACCGGTTG CTGATTCTCC TGAGCGACGG ATTTCCCCAG GACCTTGACT ACGGCGAAGA CAGGAACTCA CGGGAGTACG GGCTTAACGA CACCATGATG GCCTTTATCG AAGCCAAACG GCTGGGCATC AAGCCCTTCT GCATAACCAT CGACCAGTCG GGAAACGACT ACCTGAAAAA AATGTGCGCC CCGGAAGAGT ATTTGATCAT CAAAGATATT GCCATGCTCC CGGAACTGTT GCCGGGAATC GTTGAGTCGC TGATGGGTTG A
|
Protein sequence | MSSVNHPGDR LPEAADIAFF SSISKECGQA FASAEKQARA LLSESAYATW VSLARKIHRS FPDTDGPVRA YLDSSRPFFC EDGLGYLKNW VTESIKIGSW STACAKDFLM ATPAFLAHAR FAKINQLASD IKYILDAENG GEATATAFIK TSATILRYLS PRVYKIWKES GFRILRQNRD KGTQYFSMEP EGLDRLYLSE TTKIFKITAI AFNSTPEKAG AFYETLPNRI LRINPNLRDK ILEKILEMAS GRPDEIIEDM NAMALSLGSF SNPVQQTIFD LGKQLDEISK KAFRAYYQNV KHVLENIPVY FLVNWVSRGM DLLLENKKEG VTYFAMESPE AGAELVKWGS AAFLEQHREM LSLFCHALCG KKVRIRSNDE MAESERNRLG LIAEDTGFIF FLSSYVAEED NAAANIRYYK TAAALKAGYI EFGTLAPEFA GIWQLLESFP DRELALDIFH ILEDGRIFYN LKKNYPGLSP EIERTIENAL LKRDVPRDDL FGAALELLLR LSLGYAADIN MDAPFSKPLA GVYADLKDHL AAFPAQAETV LDSYTITARV YDTLSALVRQ KSRKAALPLP LCMNKEPEET EGTGPMVMPP NTIVEGDGSE TGTDITLTPE ELERLLDMAQ DITLLSMLTP VPAANRFYLS DLDNFTVKDG GDEARDPGSV DIKQGVTTGK TVGKAGTGKK KYYYDEWDFL AREYRTKWCC LREKEPRQSD PDMYHRIYAE YGDLIRKTRA QFQRIRPASL DIIHNVDQGD EIDLTALIRH VVDKKAGAVP SDRVFCRKDK KIRHMSTLLL IDMSASTEET APEVSAEDSQ DKKGGKSSRD DKRVIDIEKE SLIVMSEALD ALGDQYAMYG FSGHGREHVD YYVIKSFDES NTEKVKMRIC GIEPRQSTRM GTAIRHAVSK LSNREADHRL LILLSDGFPQ DLDYGEDRNS REYGLNDTMM AFIEAKRLGI KPFCITIDQS GNDYLKKMCA PEEYLIIKDI AMLPELLPGI VESLMG
|
| |