Gene Dole_3133 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_3133 
Symbol 
ID5695993 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp3760488 
End bp3763508 
Gene Length3021 bp 
Protein Length1006 aa 
Translation table11 
GC content53% 
IMG OID641265750 
Productvon Willebrand factor type A 
Protein accessionYP_001531013 
Protein GI158523143 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4548] Nitric oxide reductase activation protein 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTCAG TCAACCATCC CGGCGACCGG CTGCCGGAAG CGGCGGATAT CGCCTTTTTT 
TCCAGTATCT CTAAAGAATG CGGCCAGGCG TTTGCAAGCG CGGAGAAACA GGCCCGGGCG
CTTCTTTCTG AAAGCGCGTA TGCCACCTGG GTGTCACTTG CCAGAAAGAT TCACCGGTCA
TTCCCGGATA CAGACGGCCC GGTGCGGGCG TACCTTGATT CCAGCCGGCC GTTTTTTTGC
GAGGACGGGC TTGGCTACCT GAAAAACTGG GTTACCGAGA GTATCAAAAT CGGATCATGG
TCGACGGCCT GCGCCAAAGA CTTTTTAATG GCCACCCCGG CTTTTCTGGC GCACGCCCGT
TTTGCCAAGA TCAACCAGCT GGCGTCGGAC ATAAAGTACA TTCTTGACGC GGAAAACGGG
GGCGAAGCGA CCGCGACCGC GTTTATAAAA ACGTCGGCAA CCATCCTGCG GTATCTGTCA
CCAAGAGTCT ACAAGATATG GAAAGAGAGC GGCTTCCGTA TTTTAAGGCA AAACAGGGAT
AAGGGGACCC AATATTTTTC CATGGAACCG GAAGGGCTTG ACCGGCTTTA TCTTTCTGAA
ACCACAAAAA TTTTTAAGAT TACAGCCATC GCCTTTAACT CGACGCCTGA AAAGGCCGGT
GCTTTCTATG AGACGCTCCC CAACCGCATT CTGCGAATAA ATCCCAACCT GCGCGACAAA
ATTCTGGAAA AGATTCTTGA GATGGCATCC GGAAGACCGG ATGAAATCAT CGAGGACATG
AACGCTATGG CCCTGTCGCT TGGCTCCTTT TCCAACCCGG TCCAGCAGAC GATCTTCGAT
CTGGGCAAAC AACTGGATGA AATTTCAAAA AAGGCGTTCC GGGCCTATTA TCAAAACGTG
AAGCATGTCC TTGAAAATAT TCCTGTTTAC TTTCTGGTCA ACTGGGTGAG CCGGGGAATG
GACCTCCTTC TTGAAAACAA AAAAGAGGGC GTCACCTATT TTGCAATGGA AAGCCCGGAA
GCCGGGGCCG AACTGGTTAA ATGGGGAAGC GCCGCCTTTC TCGAACAGCA CCGGGAAATG
CTCTCACTGT TCTGCCACGC CCTGTGCGGG AAGAAGGTCC GTATTCGAAG CAATGATGAG
ATGGCCGAAT CAGAGAGGAA CCGGCTGGGT CTTATAGCGG AGGACACGGG CTTTATTTTT
TTTCTTTCTT CCTATGTGGC GGAAGAAGAT AACGCGGCGG CCAACATCCG TTATTACAAG
ACGGCTGCCG CGCTTAAGGC GGGATATATC GAGTTCGGCA CCCTGGCCCC TGAATTCGCG
GGCATATGGC AGCTGCTGGA GTCGTTTCCT GACAGGGAGT TGGCCCTTGA TATTTTTCAC
ATCCTTGAGG ACGGCAGGAT TTTCTATAAT TTGAAAAAAA ACTATCCCGG TCTTTCTCCG
GAGATTGAGC GCACAATTGA AAACGCCCTT TTAAAAAGGG ATGTCCCCCG GGACGACCTT
TTTGGCGCGG CCCTTGAATT GCTCTTACGG CTATCCCTCG GGTATGCTGC TGATATAAAC
ATGGACGCCC CCTTTTCCAA GCCCCTTGCC GGTGTTTATG CCGACCTGAA AGACCATCTC
GCGGCGTTTC CCGCGCAAGC AGAAACCGTT CTTGATTCCT ACACGATAAC GGCACGGGTC
TATGACACAC TCAGCGCACT GGTCCGGCAA AAGTCCCGAA AGGCGGCTTT GCCATTGCCG
CTTTGCATGA ATAAAGAACC GGAGGAAACA GAAGGAACCG GACCCATGGT AATGCCCCCG
AATACAATTG TGGAGGGGGA CGGTTCAGAA ACAGGGACGG ATATCACCCT GACACCGGAG
GAGCTGGAAC GGCTCCTTGA CATGGCCCAG GACATCACCC TGTTGAGCAT GCTTACGCCG
GTGCCTGCGG CGAACAGGTT CTACCTTTCC GACCTTGATA ATTTTACGGT AAAAGACGGC
GGCGATGAAG CCCGGGACCC GGGCAGTGTT GATATAAAAC AGGGGGTGAC CACCGGGAAG
ACGGTTGGTA AAGCCGGGAC CGGCAAGAAA AAATACTATT ATGACGAGTG GGATTTTCTT
GCCAGAGAAT ACAGGACAAA ATGGTGCTGC CTCAGGGAAA AGGAGCCCCG CCAGAGCGAC
CCGGACATGT ATCACAGGAT TTACGCGGAA TACGGCGACC TTATCCGCAA AACGCGGGCC
CAGTTCCAGC GGATCCGGCC GGCGTCGTTG GATATTATCC ACAATGTGGA CCAGGGGGAT
GAGATCGATC TTACCGCCCT GATTCGACAC GTTGTTGATA AAAAAGCGGG GGCCGTTCCT
TCGGACAGGG TGTTCTGCAG AAAAGACAAA AAGATACGGC ACATGTCAAC GCTACTGCTG
ATCGACATGA GCGCCTCCAC GGAAGAAACC GCGCCAGAGG TATCGGCAGA AGATTCGCAG
GATAAAAAAG GGGGAAAATC ATCCCGCGAC GACAAGCGGG TGATTGACAT TGAGAAGGAG
AGCCTGATTG TCATGTCCGA AGCGCTGGAC GCGCTGGGCG ACCAGTACGC CATGTACGGG
TTTTCCGGTC ATGGCAGGGA ACACGTGGAC TACTATGTGA TCAAGTCCTT TGACGAGTCC
AACACGGAAA AGGTGAAAAT GCGCATCTGC GGCATTGAGC CCAGGCAGAG TACGCGCATG
GGCACCGCTA TCCGCCACGC CGTTTCCAAA CTCAGCAACC GTGAGGCGGA CCACCGGTTG
CTGATTCTCC TGAGCGACGG ATTTCCCCAG GACCTTGACT ACGGCGAAGA CAGGAACTCA
CGGGAGTACG GGCTTAACGA CACCATGATG GCCTTTATCG AAGCCAAACG GCTGGGCATC
AAGCCCTTCT GCATAACCAT CGACCAGTCG GGAAACGACT ACCTGAAAAA AATGTGCGCC
CCGGAAGAGT ATTTGATCAT CAAAGATATT GCCATGCTCC CGGAACTGTT GCCGGGAATC
GTTGAGTCGC TGATGGGTTG A
 
Protein sequence
MSSVNHPGDR LPEAADIAFF SSISKECGQA FASAEKQARA LLSESAYATW VSLARKIHRS 
FPDTDGPVRA YLDSSRPFFC EDGLGYLKNW VTESIKIGSW STACAKDFLM ATPAFLAHAR
FAKINQLASD IKYILDAENG GEATATAFIK TSATILRYLS PRVYKIWKES GFRILRQNRD
KGTQYFSMEP EGLDRLYLSE TTKIFKITAI AFNSTPEKAG AFYETLPNRI LRINPNLRDK
ILEKILEMAS GRPDEIIEDM NAMALSLGSF SNPVQQTIFD LGKQLDEISK KAFRAYYQNV
KHVLENIPVY FLVNWVSRGM DLLLENKKEG VTYFAMESPE AGAELVKWGS AAFLEQHREM
LSLFCHALCG KKVRIRSNDE MAESERNRLG LIAEDTGFIF FLSSYVAEED NAAANIRYYK
TAAALKAGYI EFGTLAPEFA GIWQLLESFP DRELALDIFH ILEDGRIFYN LKKNYPGLSP
EIERTIENAL LKRDVPRDDL FGAALELLLR LSLGYAADIN MDAPFSKPLA GVYADLKDHL
AAFPAQAETV LDSYTITARV YDTLSALVRQ KSRKAALPLP LCMNKEPEET EGTGPMVMPP
NTIVEGDGSE TGTDITLTPE ELERLLDMAQ DITLLSMLTP VPAANRFYLS DLDNFTVKDG
GDEARDPGSV DIKQGVTTGK TVGKAGTGKK KYYYDEWDFL AREYRTKWCC LREKEPRQSD
PDMYHRIYAE YGDLIRKTRA QFQRIRPASL DIIHNVDQGD EIDLTALIRH VVDKKAGAVP
SDRVFCRKDK KIRHMSTLLL IDMSASTEET APEVSAEDSQ DKKGGKSSRD DKRVIDIEKE
SLIVMSEALD ALGDQYAMYG FSGHGREHVD YYVIKSFDES NTEKVKMRIC GIEPRQSTRM
GTAIRHAVSK LSNREADHRL LILLSDGFPQ DLDYGEDRNS REYGLNDTMM AFIEAKRLGI
KPFCITIDQS GNDYLKKMCA PEEYLIIKDI AMLPELLPGI VESLMG