Gene Csal_2289 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_2289 
Symbol 
ID4026442 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp2577253 
End bp2578845 
Gene Length1593 bp 
Protein Length530 aa 
Translation table11 
GC content66% 
IMG OID637967493 
ProductIMP cyclohydrolase / phosphoribosylaminoimidazolecarboxamide formyltransferase 
Protein accessionYP_574338 
Protein GI92114410 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCGAAC ACACTACCCC GTCGTCCGCC CGCCCCGTGC GCCGCGCATT GATCAGCGTG 
TCCGACAAGA CCGGCATCGT GGAGTTCGCC CGCGAGCTGG CCGCGCGTGG CGTCGCGCTG
CTGTCCACCG GCGGCACGTT CCGCCTGCTC AGCGAGCACG GTATCCCGGT GACGGAAGTC
TCCCAACACA CCGGTTTCCC GGAAATCATG GACGGCCGCG TCAAGACCCT GCACCCCAAG
ATTCACGGCG GCATTCTCGG TCGTCGCGGG CAGGACGACG AGGTGATGGC CGCGCATGAC
ATCGATGCCA TCGACATGGT CGTGGTCAAC CTCTACCCCT TCGCCGACAC TGTGGCCCGC
GAGGATTGCA CGCTGGAAGA GGCCATCGAG AACATCGACA TCGGCGGCCC GACCATGGTG
CGCGCCTGTG CCAAGAACCA CGCCCACACC ACGATCGTGG TCAACGCCGG CGACTACTCG
CGCGTACTGG ATGACATGAG CGCCCAGGGC GGCGCCGTCG GCCAGGCGCT GCGCTTCGAC
CTGGCGGTCA AGGCCTTCGA GCACACCGCC GGTTACGACG GCGCCATCGC CAACTATCTC
GGTACCCTGG CGGAAGGCGG TGAAGCCAAC TTCCCGCGCA CCTACAACGT GCAGTTCCAC
AAGAAGCAGG CCATGCGCTA CGGCGAGAAC CCGCACCAGC AGGCTGCGTT CTATGCCGAA
GCCGATGCCG CGGAAGCCAG CGTAACCACT GCCCGGCAAC GCCAGGGCAA GGCGCTTTCC
TTCAACAACG TCGCCGATAC CGATGCTGCC TTCGAGTGCG TCAAGGCCTT CCGCGAGACG
GCCTGCGTCA TCGTCAAGCA CGCCAACCCC TGTGGCGTCG CCGTCGCCGA GACCCCGCTC
GCGGCCTACG AGCGCGCTTT CGCCACCGAT CCCACCAGTG CGTTCGGCGG CATCATCGCC
TTCAATCGTG CGCTCGATGC CGCCACGGCT CGCGCCATCG TCGATCGGCA GTTCGTCGAG
GTCATCATCG CTCCGGGCAT CAGCGACGAG GCTGCCGACA TCGTCGCCGA GAAGAAGAAT
GTCCGCCTGC TCGATGTCAG CGACCATTGG CCCGGGCAGC GTCGCCCCTC GCACGACTTC
AAGCGCGTCA CCGGCGGCTT GCTGGTGCAG GACCGTGACC TGGGCATGGT CGAGCGGGAG
GAACTGCGCA CCGTCACCGA GCGCGCGCCC AGCGAGCAGG AAATGAACGA CCTCAGCTTC
GCCTGGAAAG TCGCCAAGTA CGTCAAGTCC AATGCCATCG TCTACGCCAA GCAAGGGCAG
ACCATCGGGG TCGGTGCCGG TCAGATGAGC CGCGTCTATT CCGCCAAGAT CGCGGGCATC
AAGGCCGCCG ACGAGCATCT GGAAGTGCCC GGCTCGGTGA TGGCCTCGGA TGCCTTCTTC
CCGTTCCGCG ACGGCATCGA TGCCGCGGCC CAGGCCGGCA TCACCGCCGT GATCCAGCCC
GGCGGCTCCA TGCGCGATCA GGAAGTGATC GATGCCGCCA ACGAAGCCGG CATCGCCATG
GTCTTCACCG GCATGCGTCA CTTCCGGCAC TGA
 
Protein sequence
MAEHTTPSSA RPVRRALISV SDKTGIVEFA RELAARGVAL LSTGGTFRLL SEHGIPVTEV 
SQHTGFPEIM DGRVKTLHPK IHGGILGRRG QDDEVMAAHD IDAIDMVVVN LYPFADTVAR
EDCTLEEAIE NIDIGGPTMV RACAKNHAHT TIVVNAGDYS RVLDDMSAQG GAVGQALRFD
LAVKAFEHTA GYDGAIANYL GTLAEGGEAN FPRTYNVQFH KKQAMRYGEN PHQQAAFYAE
ADAAEASVTT ARQRQGKALS FNNVADTDAA FECVKAFRET ACVIVKHANP CGVAVAETPL
AAYERAFATD PTSAFGGIIA FNRALDAATA RAIVDRQFVE VIIAPGISDE AADIVAEKKN
VRLLDVSDHW PGQRRPSHDF KRVTGGLLVQ DRDLGMVERE ELRTVTERAP SEQEMNDLSF
AWKVAKYVKS NAIVYAKQGQ TIGVGAGQMS RVYSAKIAGI KAADEHLEVP GSVMASDAFF
PFRDGIDAAA QAGITAVIQP GGSMRDQEVI DAANEAGIAM VFTGMRHFRH