Gene Cagg_3303 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3303 
Symbol 
ID7267777 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4000407 
End bp4001477 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content58% 
IMG OID643568114 
Producthistidine kinase 
Protein accessionYP_002464587 
Protein GI219850154 
COG category[T] Signal transduction mechanisms 
COG ID[COG5002] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000380955 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGACCATCC TCTTTGCGCC AGGTATCCAC GACCGCTCGA TGATTGCTTG GTTGGGGCCA 
CAATGGGTGG AGACGGCGGA TGCGAGTATG GTCGAGATGG AGCGGGCAAC CAATGAGATC
TTCACCCTTG CCATGCTGCA AGCACTACTG ATCAGCAGTG GGGTCGCAAT CGCTGTCGCA
GTCCCGGTAA GTTGGGTGGC ATCGCAGGGG ATTGTTCGGC CTATCGAACG CCTGTTGGCG
ACTGCACAGC GGATTGCCGA CGGTTATTAC CACGAGCGTG TACCGCTCCA GGGTGAGCGT
GAGCTAGTGC GGCTGGCCGC GCAATTCAAT ACAATCGCGG CGGTGCTCGA GCAGGCCGAG
CAACGTCGCG TGGCTCTCAT CGGCGATGTC GCACACGAGT TGCGCACACC ATTGGCGACC
ATTGCCGGTT ATGTTGAAGG AGTGCTCGAT GGTGTCGTGG AGGCCGATGA AGACACGTGG
GTACTGGTGC TCGATGAGGT CAACCGGTTG CACCGATTGG CAGGTGATTT GCAGGAATTG
TCGCGGGTCG AGGCCAAACA GATCGTGTTA GCACGGCAAA ACGTTCATCT CGAACCCCTC
ATCGAAGCTA TCTGCGCGCG CTTAGAACCG CAGTTTACCG AAAAGGGTGT GCGATTACAG
GTGCAGCTTG CGAACAGTCT GCCATCGGTT TGGGTCGATC CTGATCGGAT CCTTCAGGTG
TTGATGAATC TCGTCGGCAA TGCGTTGCAG TACACGCCGC GTGGTGGTTC CGTCACGATA
CGGGCATTGG TAGTTGAGAA GATGGTACAG GTGTGCGTAC ACGATACCGG GATCGGGATT
GCCGCCGAAC ATCTACCGCA TCTCTTCGAG CGGTTTTATC GAGTTGACAA AGCACGCGCG
CGGGCTACCG GTGGCGCCGG TATTGGCTTG ACCATCTGCA AAGCGTTAGT GGAGTTGCAT
GGCGGGCAGA TTGGGATCCA TAGTGATGGC CCGGGGCAGG GGACGACGTG TTGGTTTACC
CTGCCCCTAT TCGACCACAA CGTACAGCGC TTGAGCGATG CGTTGGCCTA A
 
Protein sequence
MTILFAPGIH DRSMIAWLGP QWVETADASM VEMERATNEI FTLAMLQALL ISSGVAIAVA 
VPVSWVASQG IVRPIERLLA TAQRIADGYY HERVPLQGER ELVRLAAQFN TIAAVLEQAE
QRRVALIGDV AHELRTPLAT IAGYVEGVLD GVVEADEDTW VLVLDEVNRL HRLAGDLQEL
SRVEAKQIVL ARQNVHLEPL IEAICARLEP QFTEKGVRLQ VQLANSLPSV WVDPDRILQV
LMNLVGNALQ YTPRGGSVTI RALVVEKMVQ VCVHDTGIGI AAEHLPHLFE RFYRVDKARA
RATGGAGIGL TICKALVELH GGQIGIHSDG PGQGTTCWFT LPLFDHNVQR LSDALA