Gene Cagg_3075 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3075 
Symbol 
ID7269492 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp3737980 
End bp3738960 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content58% 
IMG OID643567895 
ProductHelix-turn-helix type 11 domain protein 
Protein accessionYP_002464369 
Protein GI219849936 
COG category[K] Transcription 
COG ID[COG2378] Predicted transcriptional regulator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00643592 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000271212 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGGTCGATA TTAGCTACCG TTTGCGCAGC AAAGCCGCAC GGTTGCGGAA CCTTGAACAC 
AAACTCTACA ACGCGCCACC GCAAGGCTGG AGCGTGATCG AGCTAGCCAA ACAATGCGGT
GTCAATCGCC GCACCATCTA CCGCGACCTT GAGGCGCTCA GCGCAGCGGG AGTACCGATC
TGGGAGCACA ACGGTAAGTA CGGCATTGAC CGCAACACCT ATCTCGCCAC CGTTCGGCTC
AACCTTCACG AAGCAGTCGC GCTCTTTTTT GCGGCACGAC TGCTTAGTCA CCACAGCGAC
GAACACAACC CACATATTGT TACGGCGCTG GATCAACTCG CTGCCGGTCT ACCCGATCAA
ACCATTGCCG GTCATATGGC GCGGCTGGCA AGCATTGTTC GGGAGCGACC ACCCAATCCA
CACTACGTTC ACACCCTCGA ACTGCTCACC CGCGCATGGG CCGACCGGCA GATGGTGCGT
ATCCGTTACC GTGCCCCCAA CCGACCGCTC ACCGAACGCG ACATTGCCCC CTATTTTCTC
GAAGTAGTAC GGACAACGCC GGGAGTTTAC GTGATCGCGT ATGATCGGCT ACGCAACGAT
CTGCGCACCT TCAAACTTGA ACGGATCGAG CACGCCCAAC TCCTTGACGA ACGGTTCGAC
ATTCCGGCAG CATTCGACCC GTATGAGCGA TTGGCACAGG CGTGGGAAGT CATGCACGAG
ACGGCAGTTG CCATCCACTT ACGCTTTAGC CCGGCTGTTG CCCCGCGTAT CCGTGAGACA
CGCTGGCATC ATAGTCAGCG CCTGATTGAC AATGCCGATG GTAGCTGTGA CTTGCACCTC
ACCGTTGCCG GCATCCGCGA AATCCTGGGC TGGGTGTTAA GCTGGGGGCC TGATGTGCAA
GTGTTGGCTC CACCCGAGTT GCGAGACACC GTGATCGACT ACGCCCGCCG TCTCTTAGCA
CGGTACCAGC AGGATTGGTA G
 
Protein sequence
MVDISYRLRS KAARLRNLEH KLYNAPPQGW SVIELAKQCG VNRRTIYRDL EALSAAGVPI 
WEHNGKYGID RNTYLATVRL NLHEAVALFF AARLLSHHSD EHNPHIVTAL DQLAAGLPDQ
TIAGHMARLA SIVRERPPNP HYVHTLELLT RAWADRQMVR IRYRAPNRPL TERDIAPYFL
EVVRTTPGVY VIAYDRLRND LRTFKLERIE HAQLLDERFD IPAAFDPYER LAQAWEVMHE
TAVAIHLRFS PAVAPRIRET RWHHSQRLID NADGSCDLHL TVAGIREILG WVLSWGPDVQ
VLAPPELRDT VIDYARRLLA RYQQDW