Gene Clim_0073 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_0073 
Symbol 
ID6355596 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp78710 
End bp79903 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content60% 
IMG OID642667696 
Productputative transcriptional regulator, GntR family 
Protein accessionYP_001942158 
Protein GI189345629 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAAGAT TTTCGCAATC AGTATCAGCG CTTCGCTCCT CGGCAATCAG GGAGCTTATG 
AGCCTCGCAT CAAGGCCCGA CATCATCTCC TTTGCCGGCG GCATGCCGGG CAACGATCTC
TTTCCGGTCG AAGAGGTCGA GGAGCTGTTC CAGAACCTCG ACCCAAAAAC CAAACAGACG
GCATTCCAGT ACGGCCCGAC CCCCGGCCTG CCGTCGCTGC TCGAATCGCT CTCCGGCTAC
CTCGAACGAA AAGGGCTGCC CGTACAGAAA AACCGGCTCA TGATCACCAC CGGCTCCCAG
CAGGCGCTCA GCATCCTCGC ACGGGCATTC ATCGACCCCG GCGACCAGGT GCTCAGCGAG
TACCCCTGCT TCATCGGAGC GATAGCGGCC TTCAAGGCAT GCGGAGCCGA TATCGTCTCC
ATTCCGGTCG ATGAGGAAGG CATCGACATC GGCATGCTGC GGCATGAAGC AGGACGCCCT
TCGCCCGCAA AATTACTCTA CCTAACGCCC TACTTCCACA ACCCGGCAGG GATGCTCTAT
ACAACCCGTC GCAAACGCCA GCTCATCGAG GTCATGCAGG GACGCGACAT CCCCATCATC
GAAGACGACG CCTACGGCGA CCTCTGGTTC AGCGAAGAAG ATCGCGAACG GCTGCAGCCC
CTCAAATCGA TCGACCCCGA AGGCATCGAC CTCTGCTATA CCGGATCGTT CTCCAAAATC
CTCGGCCCCG GCCTCCGTCT CGGCTGGCTG CTCGCCCCCG AAGCCATCCA CGAAAAATGC
GAACTGATCA AGCAGTCCGC CGACGCCTGC TCGCCGAGCT TCACCCAGGT CATCGCCGAC
GCCTTCATCC GCTCGGGCAG AATAGACAGC TACATAGCCT CCGTACGCAA CGAGTACCGC
TGCCGGGCGG CCTGCATGAC CGCAGCGCTC GGAAGCCTTC TGCCGGACTA TGTGCAATGG
AACGAACCGA AAGGAGGATT CTACATCTGG CTCACCCTTC CCGAAGGAGC GGACGCCACG
GAAATTCTCA AACACGCCAT CGAAGGCGGA GCCGTCTTCG TCGCCGGCAG CACTTTCGAC
CCCGAAGGCC GACGCAACAA CGCCATCAGG CTCTCCTACT GCAACAACAC CCCGGAAGAG
ATCGAGCGGG GCATTCCGAT CGTTGCAAGG GCGATCAGGG AAGTTTGCGG ATGA
 
Protein sequence
MPRFSQSVSA LRSSAIRELM SLASRPDIIS FAGGMPGNDL FPVEEVEELF QNLDPKTKQT 
AFQYGPTPGL PSLLESLSGY LERKGLPVQK NRLMITTGSQ QALSILARAF IDPGDQVLSE
YPCFIGAIAA FKACGADIVS IPVDEEGIDI GMLRHEAGRP SPAKLLYLTP YFHNPAGMLY
TTRRKRQLIE VMQGRDIPII EDDAYGDLWF SEEDRERLQP LKSIDPEGID LCYTGSFSKI
LGPGLRLGWL LAPEAIHEKC ELIKQSADAC SPSFTQVIAD AFIRSGRIDS YIASVRNEYR
CRAACMTAAL GSLLPDYVQW NEPKGGFYIW LTLPEGADAT EILKHAIEGG AVFVAGSTFD
PEGRRNNAIR LSYCNNTPEE IERGIPIVAR AIREVCG