Gene Clim_0785 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_0785 
Symbol 
ID6353855 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp859087 
End bp860325 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content51% 
IMG OID642668409 
Productputative transcriptional regulator 
Protein accessionYP_001942844 
Protein GI189346315 
COG category[K] Transcription 
COG ID[COG2865] Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGTTA GAAATCTGAC GGAAGAACTT TTAGGAAAAG GTGAGTCAGA TCGCATTGAG 
TTTATTGCAT CAGCCCGGGC AGAAAACTCA ATTGGTCGTG CCGTGTGTGC ACTTCTCAAT
ACCAAAGGCG GCAGCGTTTT AGTCGGTGTC GATGATTGCG GGCAGGTGCT CGGTGTTCTC
AGAGAAGAGG ATGCTGATGC ACTCCGCTTA TTTTTGCATA GACACATCAC CCCTCAGGTA
TTGTTCACTG TTACTCTGGA TGATGTTCAG GGAGGCAGGG TCATTACTGT TGATATACCG
GAAGGCTCTG ACCGACCCTA TGTTTTTGAT GGGGCGGTTT ACATCAAGAA AGGGCTGGAT
ATCCTGGCGG TCGACGCTGC GACAATGCGC GAGATGGTGG TCCGGCAATC CCGCGAAACC
GAGAGATGGG AACGTCGCGT CGCTGTCGGT CTTGCCATTG ACGATCTCGA TCGCAAGCTG
CTGGATGAGA CTGTACGCAA GGCGCAGGAT CGAGGGTATC GGTTTGAAGA GGTTCACAAG
CCTGATGCCG TGCTTGCGGA TTTGGCTTTG GCTCGGTTCG GTCAATTGAC CAATGCGGCA
GATGTGTTGT TTGGTAAACG TGTTGCACTG CGCCATCCGC AGACGCGACT GCGGGCGGTT
TGCTATGAAA CGGATCGCGG AGACAATTTT ATCGATGAAC AGTTGTACGA AGGTCCGGCA
TTCTATCTGC TGGAAGAAGC GATGGTCTTT CTAAAAAGGC ATGTTGCGAT TGCTGCCGAA
TTCAAGCCTG GACAACTGGC AAGGGAATCT CGCCCGCAGT ATCCATTCAA CTCATTGCGG
GAGGGGTTGG TCAATGCGCT GGTTCATCGC GATTATGCAG CATTCTCCGG CGGCGTTTCG
GTTAGTATTT ACCCCGGACG TATTGAAATC TGGAATTCAG GACATCTTTC TATGGGGCTG
ACTCCGGAAA AACTTCGGTC GGCGACTCAT GAATCCATTC TTGTCAACCC GGATATCAGC
CATGTTTTCT ATCTGCATGA ATTGATGGAG CGGGTCGGAC GTGGTACGTT CAAAATTGTC
CAGGAATGCC GGGATATGCG GATGCGTCCG CCAGTGTGGC AGAACAAGGT ATCCGGTGTA
CATCTGACAT TTTTCGGGGT TGGGCAAGGA CAAATTTCTG TAAAGATCAA CGAACGACAA
CGAGCGCTGC TTGATGGTCT AGCAGCCTGT CGGAATTGA
 
Protein sequence
MNVRNLTEEL LGKGESDRIE FIASARAENS IGRAVCALLN TKGGSVLVGV DDCGQVLGVL 
REEDADALRL FLHRHITPQV LFTVTLDDVQ GGRVITVDIP EGSDRPYVFD GAVYIKKGLD
ILAVDAATMR EMVVRQSRET ERWERRVAVG LAIDDLDRKL LDETVRKAQD RGYRFEEVHK
PDAVLADLAL ARFGQLTNAA DVLFGKRVAL RHPQTRLRAV CYETDRGDNF IDEQLYEGPA
FYLLEEAMVF LKRHVAIAAE FKPGQLARES RPQYPFNSLR EGLVNALVHR DYAAFSGGVS
VSIYPGRIEI WNSGHLSMGL TPEKLRSATH ESILVNPDIS HVFYLHELME RVGRGTFKIV
QECRDMRMRP PVWQNKVSGV HLTFFGVGQG QISVKINERQ RALLDGLAAC RN