Gene Cagg_0159 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0159 
Symbol 
ID7269626 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp209510 
End bp210562 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content58% 
IMG OID643565031 
ProductSH3 type 3 domain protein 
Protein accessionYP_002461546 
Protein GI219847113 
COG category[C] Energy production and conversion 
COG ID[COG1413] FOG: HEAT repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.333454 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0117135 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAACG AACCACAACC GAATCAATCA TCACCTGAAC CATCACCAAC GTCACGCGAG 
ACTCGTCGGC TCACCGCACC GCGGTTACCA CGCGCTGCCG AGCCTAAAGC GTCACGCACG
CCTTCACCGA GTGAGATTGA TGAGTTGATC AATGCACTTG GTGATCCGAA TCATCCACGT
CACACGGTTG CCGTTGATGA ATTGGTCGCG ATTGGGCCTG CTGCCGTTCC GGCGCTCTGT
GCCGTTGTTG GACCACATCA GCCGTGGTTG ACGGTCTACC GCGCGACCGA AGTGCTCGCC
CAGATCGGTG ATGGTCGCGC GACCGGCCCT TTGATTGCGG CCCTGAACCA TCAAAACGCG
AATGTCCGCT GGGGGGCCGT GCGCGCACTC GCGCAAGTCG GTGATGTGCG GGCACTGTTT
GCGCTCCGCA AAGTTGTCCA GACCGATCAG GGTCGTACCA GTTGGGGCGA ATCGGTTGCC
GGAGTAGCTC AGAGTGCGCT TGATCTGCTG AATCGGCGCA GTATTTGGTC GCAGAGTCTT
GAATTGATCA AACTAGCGAT TGTGAGTGTG ATCTTCTTAC TCTCGATGGC GCTGGCCTTC
GGCGTGATCG GCACGCTCCG CAATGAACTT GATCAATTTG GGCGCTACGT ACCAGGCCAA
ACCGAATTGC CGACCCTGGT CTTGCCGACC ACACGACCCA CCGCAACCCC GCGCCCGACG
CTTGCTGCCA ATCAAACAGT GGGTCCGCAG CCGACGACAC AGGTTATCAC CGGTACGGCA
CTGCAAGTGG CGAATGTGCG ACCGCTTCCC GGTACGAATA ACCAACCGAT TGGACGCATT
AACGCCGGTG ATGAGATTAT CTTTATTGCC CGCACTGCCA ACGGTCAGTG GTATCTGATC
CGACTCGGTA ATCAGCGGAG TCCCGACTCG TTTATCGCCA ATCCTGATGG TAGCGGGACG
GGGTGGGTTA ATCAGGCGTT GGTGTCGCCG CCATCGGCTG ATGTGCCGGT GCAAGAGCCG
TTGCCGGTTA CCGTGCCGAC AGCAACCCCA TAA
 
Protein sequence
MTNEPQPNQS SPEPSPTSRE TRRLTAPRLP RAAEPKASRT PSPSEIDELI NALGDPNHPR 
HTVAVDELVA IGPAAVPALC AVVGPHQPWL TVYRATEVLA QIGDGRATGP LIAALNHQNA
NVRWGAVRAL AQVGDVRALF ALRKVVQTDQ GRTSWGESVA GVAQSALDLL NRRSIWSQSL
ELIKLAIVSV IFLLSMALAF GVIGTLRNEL DQFGRYVPGQ TELPTLVLPT TRPTATPRPT
LAANQTVGPQ PTTQVITGTA LQVANVRPLP GTNNQPIGRI NAGDEIIFIA RTANGQWYLI
RLGNQRSPDS FIANPDGSGT GWVNQALVSP PSADVPVQEP LPVTVPTATP