Gene Cag_0436 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_0436 
Symbol 
ID3748142 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp510426 
End bp512213 
Gene Length1788 bp 
Protein Length595 aa 
Translation table11 
GC content47% 
IMG OID637772969 
Productmulti-sensor signal transduction histidine kinase 
Protein accessionYP_378752 
Protein GI78188414 
COG category[T] Signal transduction mechanisms 
COG ID[COG5002] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.437679 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGAGCTA AGGCATCGCT GCACATTGGG TTAGTTTTTA GCGCTCTTTT GTTTGCTTTG 
TTGTGCGGCG GTTGGTGGTT GCTTCATCAC CAATTTAAAA GCGCTCTTAT TACCACAACG
CGAACCGAGT TGCAGCACAA TATGCAGTTA TGCCGTCAAG GGTTAATGGC GCAACCGCTT
ACCTTTTGGC AATCGCCACA AGCGGTATCG CAATGGCTTG GCGAGAGTGC TCGTTTGCTG
AACGTGCGTA TTACGCTTAT TGAAAGCAAT GGCACGGTGG TTGCCGATAC CATGATGCCT
TCTCATAAGT TGCATCAGGC TGAAAATTAC CACATGCGTC CTGAAGTGAA AGCAGCTCTC
AAGCATGGTT TTGGTGAACA TATTCGCTTT AGCTATGCAA CGCAAGAGCA GCAGCTTTAT
ACCACTTTGC CAATGGTGTT CCCTGATGGG CGGCGTATGG TTATTTGCTT TAGTAAGCCG
TTATATGATG TTGGATGGTA TAAAGAGCAT GTGCAGGGCA ATGTGCCCTT GCTCTTTTTG
GGAATGTTTG TGATGTCGCT GGGGGTTGGC ATGGGGAGTG GTTTTTTATT GACTCGCCCT
TTACGCCAAT TAGCTGCGGT AGCACGTCAA CGTCTTCAGG GCGATTTTTC GGCAGCACTG
AGCATAAAGC CTAAACATGA ATTTGGGGAA TTAGCTCATG CGCTTAATAG CATGAGTGAT
AGCGTTATAA CCATGCGCCG CCACGAAGAG TGGTATTTAG CGGTTTTTTC GGCTATTCGT
GAAGCTATTA TTGTAACCGA TGCTGCGGGG GATATTATTT TTGCCAATCC TTCGGCGGCT
CGCACTTTTC GTATGGGGCA AACTATTTTT ACCTCTCGCC CTGTTAAGCA TCTACCCGAT
CCCACGTTGC AAGAGCTTTT TAATCGCGTT CATACCACGC GGGTGATGGT ACGCAAAGAG
GAAGTTGCGC TTTCAACGGC ACGCGGTGAG CGTATTATGC AAATCAATTC CATGCCGCTT
GCTACCATGG GCAAAACCTA TGAAGGGTGC GTTTTTGTAT TGAACGATAT TACCACTGTG
CGCAACCTCG AAAAAATTCG CCGCGATTTT GTTGCCAGTG TTTCCCACGA ATTGCGCACA
CCACTCACGG TTATTAGCGG TTATACGGAA ACGTTGCTGG AGGGCGCTTT GCACGATCCT
GCCCATGCCG TTCCCTTTTT AAAAACCATT TTACAAGCCA GCCAGCAACT TACGGCGTTA
GTGAACGATG TGCTTGATCT TTCGCGCATT GAGTCGGGTG CTATTGATTA CCAATTTACT
TCGGTGGATA TTGGTGGAGT GGTGCGAAAA GCGGTGGAGT TTTTAAAACC ATCGTTGGAG
AAAAAGCAAA TTCGCCTTGA TGTACGTATT ACGGCGGGGC TGCCAACTAT TTATGCCGAT
GCACGTTATC TCGACATTGT GATTCGCAAT TTGGTAGATA ATGCCATTAA CGCCGTTGAT
GAGCGTAATG GGCGTATTCG TATTTCGGCA TTTGCTATGA ATAAGGAAGT GGTGCGCCTT
GAGGTTGAAG ATAACGGGGT GGGCATTGCT AAAGCTGATC TTGATCGCAT TTTTGAGCGC
TTTTATCGGG TTGATAAAGG GCGTTCACGC CAATATGGTG GCACAGGGTT GGGGCTTTCA
ATTGTTAAAC ATATTGTGTT AGCACATCAA GGCGATATTG TTGTAAACTC AAAGCTTAAC
CATGGTTCTG TTTTTAGTGT TCTTTTAAAA GTGGCTCATA GCAAGTAG
 
Protein sequence
MRAKASLHIG LVFSALLFAL LCGGWWLLHH QFKSALITTT RTELQHNMQL CRQGLMAQPL 
TFWQSPQAVS QWLGESARLL NVRITLIESN GTVVADTMMP SHKLHQAENY HMRPEVKAAL
KHGFGEHIRF SYATQEQQLY TTLPMVFPDG RRMVICFSKP LYDVGWYKEH VQGNVPLLFL
GMFVMSLGVG MGSGFLLTRP LRQLAAVARQ RLQGDFSAAL SIKPKHEFGE LAHALNSMSD
SVITMRRHEE WYLAVFSAIR EAIIVTDAAG DIIFANPSAA RTFRMGQTIF TSRPVKHLPD
PTLQELFNRV HTTRVMVRKE EVALSTARGE RIMQINSMPL ATMGKTYEGC VFVLNDITTV
RNLEKIRRDF VASVSHELRT PLTVISGYTE TLLEGALHDP AHAVPFLKTI LQASQQLTAL
VNDVLDLSRI ESGAIDYQFT SVDIGGVVRK AVEFLKPSLE KKQIRLDVRI TAGLPTIYAD
ARYLDIVIRN LVDNAINAVD ERNGRIRISA FAMNKEVVRL EVEDNGVGIA KADLDRIFER
FYRVDKGRSR QYGGTGLGLS IVKHIVLAHQ GDIVVNSKLN HGSVFSVLLK VAHSK