Gene Dret_1422 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1422 
Symbol 
ID8419251 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp1652631 
End bp1653848 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content59% 
IMG OID645037997 
ProductGTP cyclohydrolase II 
Protein accessionYP_003198287 
Protein GI258405545 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0108] 3,4-dihydroxy-2-butanone 4-phosphate synthase 
TIGRFAM ID[TIGR00505] GTP cyclohydrolase II
[TIGR00506] 3,4-dihydroxy-2-butanone 4-phosphate synthase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.13009 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.145821 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGTTT GTACGCCAGA GGAAGCCATT GAAGAAATCC GTCAAGGCCG CATGCTCATC 
CTGGTCGACG ACGAAGATCG CGAAAACGAG GGCGACCTGA CCATTGCCGC AGAACATATC
ACGCCTGAGG CCATCAATTT CATGGCCACC CACGGACGGG GGCTCATCTG CCTTGCCCTG
GCCCCGGAAT GGGTCGACCG CCTTGAACTG CCGATGCAGC CCCGGCGCAA CGAATCCAAA
TTCGGCACCG CCTTCACCGT CTCCATTGAG GCCGCTCAGG GAGTGACCAC CGGGATTTCC
GCACACGACC GGGCCACAAC CATTCAGGCT GCGGTCCAGG AAGATGTCAC CCCGGACGAT
ATCAGCACAC CGGGACATAT CTTCCCTTTG CGCGCCCGCA ACGGCGGAGT CCTGGTCCGC
GCCGGTCAGA CCGAGGGCAG TGTCGATCTG AGCAAACTGG CCGGATGCAA ACCAGCCTCG
GTGATCTGCG AGATCATGCG TGAAGACGGC ACCATGGCCC GGATGCCGGA TCTGGAGGTC
TTTGCCGAAA AGCACGGCCT GAAGATCGCG ACCATTGAAA GTCTGATCCG CTACCGCTCC
AAATTCGATT CCCTGGTCAC CGCCGTGGGC GAAGCGCAAT TGCCGACGAA ATTCGGTCAC
TTCCGGGCTG TCGCCTACGA GAGCGAGATC GAAGACCATA CCCATTTGGC CCTGGTCAAA
GGGGAAATCC ACGAAGACGA GCCGATCCTG GTCCGGGTCC ACAGCCAATG CCTGACCGGG
GACATCTTCG GCAGCCTCCG TTGCGACTGT GGCAATCAAC TCCAGAACGC CATGCGCATG
ATCGAAGAGG AAGGCAACGG CATCCTGCTG TATATGCGCC AGGAAGGCCG GGGGATCGGC
CTGGGCAACA AAATCCGCGC TTACCACCTC CAGGACCAGG GCAAGGATAC GGTCGAAGCC
AATCTCGAAC TCGGCTTTGA ACCCGACCTC CGCGATTACG GTATCGGGGC CCAGATCCTT
GTCGATCTTG GGGTCCAGAA AATGCGTTTG ATGACCAACA ATCCCAAGAA GGTCGTTGGC
CTGCAGGGTT ACGGGCTGGA GATTACGGAC CGTGTCGGCC TGGAGACCAC GCCCTGTGAG
GAAAACCTCT GCTACCTGCG CACCAAGCAG GAGAAAATGG GCCATATGTT CACTCAGGAA
TACGGCACAG GGGAATAA
 
Protein sequence
MPVCTPEEAI EEIRQGRMLI LVDDEDRENE GDLTIAAEHI TPEAINFMAT HGRGLICLAL 
APEWVDRLEL PMQPRRNESK FGTAFTVSIE AAQGVTTGIS AHDRATTIQA AVQEDVTPDD
ISTPGHIFPL RARNGGVLVR AGQTEGSVDL SKLAGCKPAS VICEIMREDG TMARMPDLEV
FAEKHGLKIA TIESLIRYRS KFDSLVTAVG EAQLPTKFGH FRAVAYESEI EDHTHLALVK
GEIHEDEPIL VRVHSQCLTG DIFGSLRCDC GNQLQNAMRM IEEEGNGILL YMRQEGRGIG
LGNKIRAYHL QDQGKDTVEA NLELGFEPDL RDYGIGAQIL VDLGVQKMRL MTNNPKKVVG
LQGYGLEITD RVGLETTPCE ENLCYLRTKQ EKMGHMFTQE YGTGE