Gene Cagg_1364 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1364 
Symbol 
ID7268656 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1688175 
End bp1689323 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content56% 
IMG OID643566207 
Productsignal transduction histidine kinase 
Protein accessionYP_002462707 
Protein GI219848274 
COG category[T] Signal transduction mechanisms 
COG ID[COG3920] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00521592 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGGAAATG AGGCAACGTT AAGCCCAGCC GAAACAGCGC GCATACAGGC CCGTTTGGCC 
GAGCTTGAAG AATTGGTACG CGCCTTGCAG ACGGGAGCAG CCGACGCAAT TGTCATCGAT
GGGCCGCGCG GCCCTCTCAT CTACACCCTC CGCGGGGCTG AACATCCCTA CCGAGTCTTG
GTCGAAACGA TGAACGAAGG CGCATTGACA TTGCTCGCCG ACGGCGCCAT TCTCTACTGT
AACAGTAAAT TTGCCGAGAT GGTCGGTCTG CCCCAAGATC AACTGACCGG CCGCTCACTG
CTCGACCTTG TCGCCCCGGC CGACCGATTG CTCTGCGCCG AGTTGTTGTC GGCGGGGGCT
GCCGGTTCGA GTAAAGGGCC GATTACCCTC CAAGCGGCTG ACGGTTCACA ACGACCGGCC
CAGATTTCGT TGCGTGCCCT CAAAGATGAG ACGGAAGCGC ATATGTGCGC CGTAGTAACC
GATCTCTCCG GCCCGCAAGC AGTCGCAGCC CAACTCCGCG CTGCGCTGGC CGAAAAAGAA
CTGCTGCTGC GCGAAGTCCA TCATCGGGTC AAAAACAATT TACAGATCGT TTCTAGTCTC
TTGCGTCTGC AAGCCGAACA TATTAGTGAT GAACGGGTAA GCGCTGCAGT TCACGATAGT
CAAAATCGTA TCCGCGCACT TGCGCTCGTC CACGAACAGT TGTATCGCTC CGAAAGCCTG
GCGCACATCG ATATCGGCGA ATACTTGCAA AACATTGCCA CCAGTGTTTG GCGCTCGCTG
AGTATACGCG GTAGCCCGAT CCGGTTAGTC AGCGAGGTGA TCAACGGGAT CAGTATCAAT
ATCGATCAGG CAATCGCGCT TGGGTTGATC GTGACTGAAC TAGTCTCGAA TAGTGTCAAA
CACGCCTTCC CAACCGGCAC CGCTCAAGGG ACGATCAGCC TCCGTGTGCG CCAAGACAAT
ACCGTCTTAC ATGTGGAAGT CGCCGATAAC GGCATTGGGA TGCCGCCACA GATCAATGCC
GGTGGTGGGA GCCTCGGTAT GCAGTTGGTT CATGGACTCT GCCGCCAAAT TGGAGCAACG
CTGAGTTTTG GCGAAGGACC GGGGACAACC GTTATAATCA GGGTACCGAC GAGCAAACTA
GAAGCGTAG
 
Protein sequence
MGNEATLSPA ETARIQARLA ELEELVRALQ TGAADAIVID GPRGPLIYTL RGAEHPYRVL 
VETMNEGALT LLADGAILYC NSKFAEMVGL PQDQLTGRSL LDLVAPADRL LCAELLSAGA
AGSSKGPITL QAADGSQRPA QISLRALKDE TEAHMCAVVT DLSGPQAVAA QLRAALAEKE
LLLREVHHRV KNNLQIVSSL LRLQAEHISD ERVSAAVHDS QNRIRALALV HEQLYRSESL
AHIDIGEYLQ NIATSVWRSL SIRGSPIRLV SEVINGISIN IDQAIALGLI VTELVSNSVK
HAFPTGTAQG TISLRVRQDN TVLHVEVADN GIGMPPQINA GGGSLGMQLV HGLCRQIGAT
LSFGEGPGTT VIIRVPTSKL EA