Gene Cpha266_1129 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_1129 
Symbol 
ID4570336 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp1279185 
End bp1280612 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content49% 
IMG OID639765725 
Productputative transcriptional regulator 
Protein accessionYP_911593 
Protein GI119356949 
COG category[K] Transcription 
COG ID[COG2865] Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAGAAA ATAACCGCAT CGAATACAAG CGCGAGCTGA CTGACAGGCT TGAGAAAGAG 
GTCATCGCAT TTCTGAACTA TCATGATGGT GGCATTATCT ATATCGGTAT TGACAAGTAT
GGAGCGGTCT ACGGCGTTTC GGAGTGCGAC GCTCTGCAGC TTGTCATCAT GGATCGTCTC
AAAAATAATA TTCAGCCCTC ATGCCTTGGT TTGTTCGATG TCATTCACGA AACCCGTGAA
GGCAAAGATA TTATTAAAAT CATCGTTGCC AGTGGTACCG AAAAGCCCTA TTACCTGCGC
AGGTTCGGTA TGTCGGAAAA AGGATGTTTC ATTCGCATTG GCAGCGCCAG TGAGCCCATG
ACGCAGCGCA TGATTGAAGA GCTGTTTGCC CGGCGTACGC GTAATTCAAT TGCCCGCATC
CGTTCTCTGC GGCAGGATCT CACCTTTGAA CAGCTTAAAA TCTACTATCA GGAGGCAGGT
CTTACGCTTG GCGACAAATT CGCCGCGAAT CTTGAACTGC TCACCGAGGA CGGAGTTTAC
AATTATGCAG GCTACCTGCT GGCAGACCAG AATGGAAACT CTGTGCAGGT GGCAAAATAC
GCAGGCACTG ACAGGGTTGA TCTGCTTGAA AGCAAGGACT ACGGATTCTG TTGCCTGGTT
AAAACCTGCA AGCAGATACT CGATCGTCTG GAGGGAGTCG AGAACCGTGT CATCAATAAA
ATAACTCCCC GTGAGCGTAT CAGCCGAAGC CTGTGGGATA AGGTCGCCCT TCGTGAAGCC
GTTATCAACG CCATCATTCA TAACGACTAC AGCACAGAAC TGGTGCCAAA GTTTGAAATT
TTTGCTGACC GGCTGGAAAT TACCTCGGCA GGTACCGTTC ATCCCGGATC GGAGCAGGAT
GATTTTTTTG CCGGATATTC CATGCCTCGC AATAAAACTC TGATGCGGGT CTTCAAGGAT
CTTGCTATGG TGGAATACCT CGGCTCCGGA ATGCCCCGGA TTCTCAAAGC TTATCCTCGT
GAAGCCTACA CCTTCTCCAC CCATTTCATC CGGACGACCT TTCCTGTATC ACCGGAAGCT
CTTGCGCTGG AAAAAGATGT GATGAAAGCC GAAGCAAAAA CCTCGGGGGC CCAGTCAAAG
GGCCTCGTTA CCGGACAAGT CACCGGACAA GTCACCGGAC AAGTCACCGG ACAAGTCGAG
AGATGGATGT ATGATGTACT GACCGCCTGC ATTGATCCGA AAAAAAGTAT GGAAATTCAG
GAAATCATCG GGATCAAGCA CCGCGAAACA TTTCAACGGA ACTATCTTGA CCTTTTACTC
GAACAGGGTT TACTTGTTCG CACCATTCCC GACAAACCTC AAAGTCGCTT TCAACGCTAC
AAAACAACCG CGCAAGGTCA AACATTTTTG AAGCAGAACA GACAATGA
 
Protein sequence
MTENNRIEYK RELTDRLEKE VIAFLNYHDG GIIYIGIDKY GAVYGVSECD ALQLVIMDRL 
KNNIQPSCLG LFDVIHETRE GKDIIKIIVA SGTEKPYYLR RFGMSEKGCF IRIGSASEPM
TQRMIEELFA RRTRNSIARI RSLRQDLTFE QLKIYYQEAG LTLGDKFAAN LELLTEDGVY
NYAGYLLADQ NGNSVQVAKY AGTDRVDLLE SKDYGFCCLV KTCKQILDRL EGVENRVINK
ITPRERISRS LWDKVALREA VINAIIHNDY STELVPKFEI FADRLEITSA GTVHPGSEQD
DFFAGYSMPR NKTLMRVFKD LAMVEYLGSG MPRILKAYPR EAYTFSTHFI RTTFPVSPEA
LALEKDVMKA EAKTSGAQSK GLVTGQVTGQ VTGQVTGQVE RWMYDVLTAC IDPKKSMEIQ
EIIGIKHRET FQRNYLDLLL EQGLLVRTIP DKPQSRFQRY KTTAQGQTFL KQNRQ