Gene Cpha266_1972 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_1972 
Symbol 
ID4570412 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp2286508 
End bp2287701 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content52% 
IMG OID639766553 
ProductGntR family transcriptional regulator 
Protein accessionYP_912409 
Protein GI119357765 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAACGTT TCTCAAAAGC CGTATCGCTG CTCCGGTCAT CTGAAATCAG GGATCTCATG 
ACCCTTGCCT CAAGGCCTGA CATTATCTCC TTTGCCGGAG GCATGCCTGG CAATGATCTC
TTTCCCGTGC AGGAAATAGA AGAGCTCTTC AGCAACCTCG ACGAGAAAAC AAAACAGGCA
GCATTTCAGT ATGGGCCTAC ACCGGGTCTT CCCTCTCTGC TTGAATCTCT TTCAGGATTT
CTTGAGCGCA AAGGACTGCC GGTCAAAAGC AACCGGCTGC TCATCACCAC CGGTTCCCAG
CAGGCACTCA GTCTGCTTGC CAAAACCTTT ATAGATCCGG GGGACCGGGT GCTGGTTGAA
CAGCCCTGTT TTATTGGAGC TCTTTCAGCA TTCCGTTCCT CTGAAGCCGC GCTGCATGGC
ATCCCTGTTG ACAGGGAGGG ACTGGTCATC GATCTGCTCA ACGAAGAAAT CAGAAAAAAA
GAGAGAGCCA GGCTGCTCTA TATCACCCCC TATTTCCATA ATCCGGCAGG CCTGCTCTAC
AGCAAGGAAC GTAAAGCAGA ACTTATCAGA ACACTGCAAG GTTCAAACAT CCCGCTCATC
GAGGATGATG CCTACGGCGA CCTGTATTTC CATGAAGAGG ATCGGGAACG GTTACAACCA
ATCAAATCCA TCGATCCGGA AGACATTGAT GTCTGTTATA CCGGTTCCTT CTCGAAAATT
CTCGGTCCCG GCCTCAGGCT CGGATGGATG CTCGTCCCTG AAGCGATCCA TGAAAAATGC
GAGCTGATCA AACAGTCGGC CGACGCCTGT TCTCCAAGTT TCACCCAGGT GCTCGCTGAC
GCCTTCATCC GTTCCGGCAA GATCGACAGC TATATTGCCG GTGTACGTCA GGAATATAAA
AAAAGAGCTT CGGCCATGGT TGCCGCTCTG AAGGAGCATC TCCCATCATA CGTTCACTAC
AACGAACCAA GGGGCGGATT CTACATCTGG CTGACGCTGC CGGAAGGGAG CGACGCAACG
GAGATCATGA AAATCGCCGT CAAAGGCGGG GCGGTCTTCG TTGCAGGAAA AACCTTTGAT
CCTGAAGGAA AAAAAAACAA CACGCTTCGC CTCTCCTACT GCAACAACAC CCCGGAGCAG
ATCGCCGAAG GAATTCCGAT CATCGCGGCT GCAATCAGGC TGCTCTGCGG TTGA
 
Protein sequence
MQRFSKAVSL LRSSEIRDLM TLASRPDIIS FAGGMPGNDL FPVQEIEELF SNLDEKTKQA 
AFQYGPTPGL PSLLESLSGF LERKGLPVKS NRLLITTGSQ QALSLLAKTF IDPGDRVLVE
QPCFIGALSA FRSSEAALHG IPVDREGLVI DLLNEEIRKK ERARLLYITP YFHNPAGLLY
SKERKAELIR TLQGSNIPLI EDDAYGDLYF HEEDRERLQP IKSIDPEDID VCYTGSFSKI
LGPGLRLGWM LVPEAIHEKC ELIKQSADAC SPSFTQVLAD AFIRSGKIDS YIAGVRQEYK
KRASAMVAAL KEHLPSYVHY NEPRGGFYIW LTLPEGSDAT EIMKIAVKGG AVFVAGKTFD
PEGKKNNTLR LSYCNNTPEQ IAEGIPIIAA AIRLLCG