Gene Csal_1101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_1101 
Symbol 
ID4029038 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp1250412 
End bp1251806 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content69% 
IMG OID637966278 
ProductGntR family transcriptional regulator 
Protein accessionYP_573156 
Protein GI92113228 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAATCT GGACACCCGA ACTGCCCGAG GGCGGTCCAC GCTACCGGCG CATCGCCGAG 
GCCATCGCCA GGGCCGTCGA AAGCGGCGAT CTGGCCCCCG GCGACAAGCT ACCGCCCCAA
CGCCGACTGG CCGATCGTCT GGGCGTCACC ATCGGGACCA TCACCCGCGC CTACGCCACG
GCCCACCGCC AGGGCTGGGT CGATTCGCGG GTGGGCAGCG GTACCTATGT CCGCCAGCCC
CTCGGGGAAG CGCCCACCAG TTTTCGTGCC GGGCAACCTG TCGACCAGGG CATGATCGAC
ATGGGAATGA GCTTGCCGCC GCCGCATCCG TTGCGCAGTC AGGGACTTCA AAACGCCCTC
GAGAGTCTCA CTGCCTCGCC GCAGTCCCTT CAGGCGGCCA CCGAGTACCA GCCGGAGGCC
GGCGACGAGG GGCAGCGGGC GCGCATCGCG GCCTGGCAAG GAAGCCTCGG CCTGCCCGAT
GACCCGAGTC GGCTGGTCAT CACGCAAGGC GGCCAGCACG GCATCCAGCT CGCCCTGCAA
GTCCTCACCC AGCCGGGTGA TCTGGTGGCG GGCGACGCGC TGACGTACCC GGGCTTCATC
TCCGCCGCCC AGCAGGCCCA CCTGCGGCCG ATGGCAGTGC CCATGGACGA GGGGGGCATG
GACCTCGATG CCCTGACCCG GTTGTGCGCT CGCCAACCGC CGCGCGCGCT CTACACCACG
CCCGATCTCA ACAATCCCAC CGGTGCACGA CTCGACGAAG CACATCGCAG GCGACTGGTG
GCCCTGGCGC GCGAGCACGA CATCTGGCTG ATCGAGGACG CCGTGCAGTA CCTCCCGCAG
GCCGAGCGCG GCACGCCACT GGTCGAGCTG GCGCCAGAGC GCACCCTGCA CATCTTCAGT
ACCTCCAAGG TGCTGGCCGG CGGACTGCGC ATTGGCAGCC TCACGGTGCC GGCGCATCTG
CTGGAGCGGC TGGGTACGGC GATTCGCACC CAAAGCTGGA TGGTGGCGCC GCTGATGGTC
GAGACGGCCT GCCGCTGGAT GGAGAACCCC GCCAGCCAGG AACTGCTCCA CTGGCAGATC
GACGAGCTTG CCGCTCGTCG CACGCTGGCG CTGGACCGCC TGGCAGTACA TGGCGCGCGG
AGCCGTCAGG GCTCGTCGCT GGTCTGGCTG CCGCTACCGG CAGGACGACG CAGCAGCGAG
CTGCAGACGC TGCTTGCCAG GCGCGGCGTC AAGGTGTCCA CGCCGGAGCC CTTCTGCATG
GGCAGCGAAC CGGCCCCCCA GGCACTGCGA TTGTGTCTGG GGCCTCCTGC GAATCGCGAC
TCACTGGAAA AGGCGCTGTC GCTGATCGAC GAGGCACTCT CGGAAGCGCC ATCGTCGCCC
TGGCAGACGC TGTAA
 
Protein sequence
MTIWTPELPE GGPRYRRIAE AIARAVESGD LAPGDKLPPQ RRLADRLGVT IGTITRAYAT 
AHRQGWVDSR VGSGTYVRQP LGEAPTSFRA GQPVDQGMID MGMSLPPPHP LRSQGLQNAL
ESLTASPQSL QAATEYQPEA GDEGQRARIA AWQGSLGLPD DPSRLVITQG GQHGIQLALQ
VLTQPGDLVA GDALTYPGFI SAAQQAHLRP MAVPMDEGGM DLDALTRLCA RQPPRALYTT
PDLNNPTGAR LDEAHRRRLV ALAREHDIWL IEDAVQYLPQ AERGTPLVEL APERTLHIFS
TSKVLAGGLR IGSLTVPAHL LERLGTAIRT QSWMVAPLMV ETACRWMENP ASQELLHWQI
DELAARRTLA LDRLAVHGAR SRQGSSLVWL PLPAGRRSSE LQTLLARRGV KVSTPEPFCM
GSEPAPQALR LCLGPPANRD SLEKALSLID EALSEAPSSP WQTL