Gene Dret_0988 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_0988 
Symbol 
ID8418810 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp1162006 
End bp1163463 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content59% 
IMG OID645037557 
Productinosine-5'-monophosphate dehydrogenase 
Protein accessionYP_003197854 
Protein GI258405112 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0516] IMP dehydrogenase/GMP reductase 
TIGRFAM ID[TIGR01302] inosine-5'-monophosphate dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.00325513 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000530591 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGACAAGA TTACCGGAAC CGGGCTGACG TTTGATGATG TGTTGCTGCT TCCCCGCTAT 
TCCGACGTCC TGCCGGACAC GGTTGATGTC GGGACCCAAC TCACACCACA GATCCGCTTG
AATGTGCCGC TTTTGAGCGC GGCCATGGAC ACCGTGACCG AATCCCGGAT GGCCATCTCC
ATGGCCCGGG CCGGGGGAAT CGGCATCATC CACAAGAATA TGACCATTGA TCAGCAGCGG
CTGGAGGTGG AGAAAGTCAA AAAATCCGAA AGCGGCATGA TCGTTTCCCC GGTCACCGTG
GAACCGGACT ATACCATCGC TCAGGCCCTG GATATCATGT CGGAATATCG CATTTCAGGT
TTGCCAGTGG TCACCGAGGG CCATCTGGTC GGGATCGTGA CCAACCGTGA CGTCCGTTTC
GTCAAGGATC TGCAGACCAC GGTCGCCGAC GTCATGACCA GCAAGAATCT GGTCACCGTG
CCTGTGGGCA CGACCATGGA AGAGGCCAAG AAGCACCTGC ACGCCAGCCG GATTGAAAAG
CTGCTCGTCG TGGATGAAGA CAATAATCTG CGGGGCCTGA TCACCATCAA GGACATCGAA
AAGGTCAAGA AATATCCCGA TTCCTGCAAA GATGAACTCG GTCGTCTGCG GGTCGGCGCC
GCCCTGGGCG CTGGGGGCGA CCGGGATGAG CGCGCGGCAG CCCTTCTGGC GGCGGGCGTT
GATGTCCTTG TCGTGGATTC GGCTCACGGC CACAGCAAGA ATATTATCGA GGCGGTCAGG
ACGTTGCGGC GCAGCCATCC GGACTGCCAG CTTATAGCCG GCAACGTGGC CACCTATACC
GGAGCTTCGG CCCTGCTCGA GGCTGGAGCC GACGCGGTCA AAGTCGGCAT CGGGCCGGGC
TCGATTTGTA CCACCCGCGT GGTGGCCGGT GTTGGAGTGC CCCAGATCTC GGCGATCATG
GAAGTCTCCA AGGCCTGCAA TGAGCACGGC AAATGTTTGA TCGCCGACGG TGGTGTGAAA
TTTTCCGGCG ATGTCATCAA GGCCCTGGCC GCCGGAGCCG ATTCGGTCAT GATGGGCTCC
ATGCTGGCCG GGACAGAAGA AAGCCCGGGG GAGACCATCC TCTACCAGGG TCGGAAGTAC
AAAATCTACC GCGGTATGGG ATCCATTGAT GCCATGAAGG ACGGGAGTTC GGACCGCTAT
TTCCAGGACG ACTCCAAAAA GCTCGTTCCG GAAGGCATTG TCGGCCGAGT GCCCTTCAAA
GGCTCGGCCA CAGAAACCGT CTATCAGCTT ATGGGCGGCA TGCGCTCTGG CATGGGCTAT
GTCGGGTGCG GCACGGTCAA AGAACTCAAG GAAGAGGCCC AGTTCGTCCA GATCTCCCCG
GCCGGACTGC GGGAAAGCCA TGTCCACGAT GTGGTTATCA CCAAAGAGGC GCCCAACTAC
CGCATCGAGT CGCCCTGA
 
Protein sequence
MDKITGTGLT FDDVLLLPRY SDVLPDTVDV GTQLTPQIRL NVPLLSAAMD TVTESRMAIS 
MARAGGIGII HKNMTIDQQR LEVEKVKKSE SGMIVSPVTV EPDYTIAQAL DIMSEYRISG
LPVVTEGHLV GIVTNRDVRF VKDLQTTVAD VMTSKNLVTV PVGTTMEEAK KHLHASRIEK
LLVVDEDNNL RGLITIKDIE KVKKYPDSCK DELGRLRVGA ALGAGGDRDE RAAALLAAGV
DVLVVDSAHG HSKNIIEAVR TLRRSHPDCQ LIAGNVATYT GASALLEAGA DAVKVGIGPG
SICTTRVVAG VGVPQISAIM EVSKACNEHG KCLIADGGVK FSGDVIKALA AGADSVMMGS
MLAGTEESPG ETILYQGRKY KIYRGMGSID AMKDGSSDRY FQDDSKKLVP EGIVGRVPFK
GSATETVYQL MGGMRSGMGY VGCGTVKELK EEAQFVQISP AGLRESHVHD VVITKEAPNY
RIESP