Gene Csal_0252 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_0252 
Symbol 
ID4029197 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp286182 
End bp287519 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content67% 
IMG OID637965403 
Productdeoxyguanosinetriphosphate triphosphohydrolase-like protein 
Protein accessionYP_572315 
Protein GI92112387 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0232] dGTP triphosphohydrolase 
TIGRFAM ID[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGCCA TGCAGTGGGA GCGTCTACTC GACCCGACCC GCCTCAACGA CAAGCACGGA 
GGTTCGCGCG AGGAGATCGG TCGCAGTCCG TTCCACAAGG ATCACGACCG CATCGTCTTC
GCCGGCTCGT TCCGGCGCCT GGGACGCAAG ACACAGGTCC ACCCGCTGAC CGACAACGAC
CATATCCACA CGCGCCTGAC GCATTCGCTG GAAGTGGGCT GCGTGGGACG TTCGCTGGGC
ATGATCGTCG GCGAACAGCT GCGCGAGCGG TTGCCGGCCT GGATCACGCC GGCCGACCTG
GGGGTGATCG TGCAGGCGGC TTGCCTGGGG CACGATATCG GCAATCCGCC TTTCGGGCAC
GCCGGGGAGT ACGCGATCCG CGACTGGTTC AAGCGTGCCG AGGCGGATGG CAGTGGGCTG
TTGGCCGATC TGGCGCCGGC CGAGCGCGAG GACCTGCTGA CCTACGAGGG CAACGCCCAG
GGGTTCCGCA TCGTCACCCA GATCGAGTAC AACCAGTTCA ACGGCGGCAT GCGTCTGACC
GCCGCCACGC TGGGCACGAT GCTCAAGTAT CCGTGGACCG TGGAGCGCGC CTCGCGGGCC
GGCAAGTTCG GCTGTTATCA ATCGGAACGC GAGCTGCTGG GCGAGGTCGC CGAGCGCCTG
GGACTGTTGC CGCGCGGCGA GACCGCCTGG TGCCGGCACC CGCTGGCCTA TCTCGTCGAG
GCCGCCGACG ATATCTGCTA TGCGCTGCTG GATCTCGAGG ATGGCCTGGA AATGGGCATC
CTGCGCTTCG ACGAAGTGGC GGAGATCCTG ATCCAGATCG CCGGGGGCGC GCCCGAGGGC
TACGACGGCA TGCGCGCGCG GGGCGTATCG CAGCGCCGAC GCATCGCGAC GTTGCGCGGG
GCGGCCATGG AGCGCGCGGT GAACGATGTC GCTGCGGTGT TCGTGGCGCA TGAAAGCGCG
CTGCTCGGCG GTACGCTGAG CGAGGATCTG CTCGAGCTCT GCCACCCCGA CCTGGGCTGG
GGCGTGGGCA CGGCCAAGCG CATCGCCCGC GAGCGCATCT TCCAGAACGA GCGCAAGGCC
AAGCTGGAGA TCGGCGCCTA CACGACCCTG GGAATCCTGC TCGAAGCCTT CATCGGTGCC
GCCCATGAGT TGCACTACAG CGGCCAGTCA TCCTTCAAGC ATCAGCGAGT GCTGGCGTTG
ATCGGCGAGA ACACGCCCAA GTCGTCCTGG TCGCTGTATG CCAGCTACCG GCGCATGCTC
GATTTCATCG GCGGCATGAC CGACCACTAC GCCGTCGATC TGGCCCAGGA GATGGGCGGC
CGCCTGCGCG GCGAATGA
 
Protein sequence
MSAMQWERLL DPTRLNDKHG GSREEIGRSP FHKDHDRIVF AGSFRRLGRK TQVHPLTDND 
HIHTRLTHSL EVGCVGRSLG MIVGEQLRER LPAWITPADL GVIVQAACLG HDIGNPPFGH
AGEYAIRDWF KRAEADGSGL LADLAPAERE DLLTYEGNAQ GFRIVTQIEY NQFNGGMRLT
AATLGTMLKY PWTVERASRA GKFGCYQSER ELLGEVAERL GLLPRGETAW CRHPLAYLVE
AADDICYALL DLEDGLEMGI LRFDEVAEIL IQIAGGAPEG YDGMRARGVS QRRRIATLRG
AAMERAVNDV AAVFVAHESA LLGGTLSEDL LELCHPDLGW GVGTAKRIAR ERIFQNERKA
KLEIGAYTTL GILLEAFIGA AHELHYSGQS SFKHQRVLAL IGENTPKSSW SLYASYRRML
DFIGGMTDHY AVDLAQEMGG RLRGE