Gene Csal_2584 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_2584 
Symbol 
ID4027120 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp2896656 
End bp2897816 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content65% 
IMG OID637967792 
ProductGTP cyclohydrolase II 
Protein accessionYP_574630 
Protein GI92114702 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0108] 3,4-dihydroxy-2-butanone 4-phosphate synthase
[COG0807] GTP cyclohydrolase II 
TIGRFAM ID[TIGR00506] 3,4-dihydroxy-2-butanone 4-phosphate synthase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGCTTT CATCCAAACA GGGCTTGGCT TCCATCGACG CCATCGTCGA GGACATCCGC 
CAAGGCAAGA TGGTCATTCT CATGGACGAT GAGGATCGCG AGAACGAAGG CGATATCATC
ATGGCCGCCG AGTGCGTCGA CGCCGAGCAC ATCAATTTCA TGGCGCGCTA CGCCCGCGGC
CTGATCTGCC TGCCGATGAC CCGGGAGCGT TGCGAACGCC TCGAACTGCC CTTGATGGTC
CGCGACAACG GCTCCGGTTT CGGTACCAAG TTCACCGTTT CCATCGAAGC CGCGCGCGGC
GTGTCCACGG GGATCTCCGC CTCCGACCGT GCTCGCACCG TACGTGCGGC GGCGGCGCGC
GACGCCGTGG CCGCGGATAT CGTCCAGCCA GGGCATATCT TTCCGCTGAT GGCCGAGCCG
GGTGGCGTGT TGCGGCGCGC CGGGCATACC GAGGCGGCAT GCGACCTGGC CGCCATGGCC
GGTTTCGAAC CGAGCGGCGT GATCTGCGAG GTCATGAACG ATGACGGCAG CATGGCGCGT
CGCGACGAGC TCGAACGTTT CGCCGCCGAG CATGACATCA AGATCGGCAC CATCGCCGAT
CTGATTCACT ACCGTATTCA CCACGAGCGC ACCGTCGACG AGGTCGAGCG CAGCGTCGTC
GATACCGCTT TCGGCGAGTT GACCCTGCAC GTCTTCCGCG ACCGCATCCA GAATACGCAT
CATCTGGCGC TGGTGAAGGG CACCCCGCGC ACCGAGTCGC CGACCACCGT ACGTGTGCAC
ATCGCCGATA CGCTGCGCGA CCTGCTGATG CTGACCAGGC CGGATAGCCA CAGCTGGACC
GCTGCCAGTG CCCTGGCCCA GATCGCCGAC GCCGAGGCGG GCGTGTTCGT GCTGCTCGAT
GACGGCCGAC CGCGGCTCGA TCTGAAAGAC CAGCTCGACG TGCTGCTCGG ACGCAAGCCG
GCCCCGCGTT CCAGCGAGTC CGACGGCGCC GGCAATTATC TGACCATCGG CACCGGCTCG
CAGATTCTGC GCCAGCTGGG TGTGGGGCAG ATGCGGCTGT TGAGTTCGCC GTGGAAGTTC
TCCGCGCTTT CCGGTTTCGA CCTCGAGGTC GTCGAACGGG TCGGAGGCGA TACCCCCGAG
AGCGACCAGC CGGTAGAATA G
 
Protein sequence
MALSSKQGLA SIDAIVEDIR QGKMVILMDD EDRENEGDII MAAECVDAEH INFMARYARG 
LICLPMTRER CERLELPLMV RDNGSGFGTK FTVSIEAARG VSTGISASDR ARTVRAAAAR
DAVAADIVQP GHIFPLMAEP GGVLRRAGHT EAACDLAAMA GFEPSGVICE VMNDDGSMAR
RDELERFAAE HDIKIGTIAD LIHYRIHHER TVDEVERSVV DTAFGELTLH VFRDRIQNTH
HLALVKGTPR TESPTTVRVH IADTLRDLLM LTRPDSHSWT AASALAQIAD AEAGVFVLLD
DGRPRLDLKD QLDVLLGRKP APRSSESDGA GNYLTIGTGS QILRQLGVGQ MRLLSSPWKF
SALSGFDLEV VERVGGDTPE SDQPVE