Gene Dret_2110 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_2110 
Symbol 
ID8419960 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp2399334 
End bp2401283 
Gene Length1950 bp 
Protein Length649 aa 
Translation table11 
GC content61% 
IMG OID645038703 
Productputative molybdopterin biosynthesis protein MoeA/LysR substrate binding-domain-containing protein 
Protein accessionYP_003198972 
Protein GI258406230 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0303] Molybdopterin biosynthesis enzyme 
TIGRFAM ID[TIGR00177] molybdenum cofactor synthesis domain 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0934426 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGATCAG AACGCACTAT TTATCTTCAA ATGCAACCGA TTGAACAGGC GCTGGAGACC 
GTGCAGCAGG AGTTGCGCCC GGCAACACTC CTTGGTCATG AACTTGTGGC CACCGAAGAA
GCAGTCGGCC GGGTGACGGC TGAGGCGGTC TATGCCCAAT ACTCCTCACC GACCTTTCAT
GGCGCTGCAA TGGATGGTAT CGCGGTCCGG GCCGCAGACA CCTTTGCCGC CAGAGAGGGC
GCGCCCGTCC ATTTGGAGCG CAACACCGGC TTCGTCGAGG TCAACACCGG CAATCCGTTG
CCGGACGGCA TGAACGCGGT GATCATGGTC GAGCATATCG ATTTTGTGGA TTCTGACACC
GTCGCCATCG AGGCCCCGGC CTTTCCTTGG CAACATGTGC GGCGCATTGG CGAGGACATT
GTTGCCACGG AACTTCTCCT GCCTCAAAAC CACACCTTGA CGCCCTACGA TACCGGCGCC
CTCTTGAGCG CCGGTATCTG GGAACTCGCT GTCTGGCAAC AGCCTGTTGT CGAGGTTATC
CCCACCGGGG ACGAGGTCCT CGATTTTCGG CAGCGCCCCG AGCCACGACC CGGTCAGGTC
GTGGAAAGCA ATTCCCAGGT CCTGGCTGGC CTGGCCCGAC AATGGGGCGC TGAAGTCCGG
TGTCAGCCGC CGGTGGCTGA TGATGAAAGC ATCCTTTTGC AGGCTGTCCA GGACGCTCTC
GACGGGCCAG CCCATGTGGT GGTTATCGGT GCCGGGTCCT CGGCCGGGAG CAAGGATTTC
AGCCGCCGGG TCATGGAACA ATGCGGCAGG ATTCTCGTTC ATGGCATCAC GGCCATGCCC
GGCAAACCTT CCCTGCTTGG TGTGGCCGCG AACGGGAAAC TCCTGGTGGG GGCTCCCGGG
TATCCGGTCA GTGCCGTGGT CTGTTACGAG CAATTGCTCC AGCCGCTTTT GGCGCAAATG
CAACACAAGC CGGTGAACCG GCGACCGGTT ATCGAGGTGG AAATCAACCG GAAATTGCCG
TCCAAACTCG GCGTGGACGA GTTCGTCCGG CTGGCCATCG GCAAGGTGGG CGATAAATGG
GTGGGAACAC CTCTGGCCCG GGGAGCAGGC ATGATTACCA CGTTGACCCG AGCCCAGGGA
GTGGCCCGGA TTCCCACGGA AAGCGAGGGC GTTGAGGCGG GGCAGACCGT TCGAGCCGAA
CTGTTTGTCC CGGCCGAGGA GGTCGAGCGG GTACTCGTTG CCGTGGGCAG TCACGACAAC
ACCCTGGATC TCTTGGCCAA TGCCCTGCAG GGGCTCAAGC ACCCCATCGG ACTGGCCTCC
AGTCATGTCG GCAGTATGGG GGGATTGACC GCCCTGCAAA ATGGTTCGGT CCATATCGCC
GGAGCCCATC TGTACGACCC GGAAAGCGGG GATTACAACT TCCCGTTTCT CCAGCGCTAT
CTTGCCGATA TTCCGGTGAC CGCAGTGAAT CTGGCCATCA GGCACCAGGG GTTGATCGTT
CCCCGGGGCA ATCCCAAAGG GATCCAGGGA ATCCAGGACC TGACCCGCGA CGATGTCCGG
TTTATCAACC GGCAACGCGG GGCCGGCACC CGTATTCTCC TTGATGATCA TCTGCACCGC
GCGGGGCTCT CAGCCCGGGA GGTCAACGGC TATGGGCACG AAGAATTTAC GCATATGGCC
GTTGCGGTGA ATGTCCTGAG TGGTGCTGCG GATTGCGGCA TGGGGATATA TGCGGCGGCC
AAGGCCTTGG ATCTCGATTT CGTTCCTTTG GCCCGGGAGC GCTACGACCT GTTGATCCCC
ACAGCATATC TTGAGGACGA AAAAATCCGC TCTGTCTTGG ACCTGTTGGG GAAACCGGAG
TTTCAGCAGG AGATCGAATC CCTGGGCGGG TACGATACCC ATCTGACCGG ACAAGTCATG
CAGCCCGGCA TGGGCCTGGG CGAAGGCTAA
 
Protein sequence
MGSERTIYLQ MQPIEQALET VQQELRPATL LGHELVATEE AVGRVTAEAV YAQYSSPTFH 
GAAMDGIAVR AADTFAAREG APVHLERNTG FVEVNTGNPL PDGMNAVIMV EHIDFVDSDT
VAIEAPAFPW QHVRRIGEDI VATELLLPQN HTLTPYDTGA LLSAGIWELA VWQQPVVEVI
PTGDEVLDFR QRPEPRPGQV VESNSQVLAG LARQWGAEVR CQPPVADDES ILLQAVQDAL
DGPAHVVVIG AGSSAGSKDF SRRVMEQCGR ILVHGITAMP GKPSLLGVAA NGKLLVGAPG
YPVSAVVCYE QLLQPLLAQM QHKPVNRRPV IEVEINRKLP SKLGVDEFVR LAIGKVGDKW
VGTPLARGAG MITTLTRAQG VARIPTESEG VEAGQTVRAE LFVPAEEVER VLVAVGSHDN
TLDLLANALQ GLKHPIGLAS SHVGSMGGLT ALQNGSVHIA GAHLYDPESG DYNFPFLQRY
LADIPVTAVN LAIRHQGLIV PRGNPKGIQG IQDLTRDDVR FINRQRGAGT RILLDDHLHR
AGLSAREVNG YGHEEFTHMA VAVNVLSGAA DCGMGIYAAA KALDLDFVPL ARERYDLLIP
TAYLEDEKIR SVLDLLGKPE FQQEIESLGG YDTHLTGQVM QPGMGLGEG