Gene Cagg_0758 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0758 
Symbol 
ID7268077 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp938889 
End bp940319 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content56% 
IMG OID643565609 
ProductVWA containing CoxE family protein 
Protein accessionYP_002462118 
Protein GI219847685 
COG category[R] General function prediction only 
COG ID[COG3552] Protein containing von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.38799 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCGAC GAATTACCGA GTTTATTGCC GGGTTACGCG CTGCCGGCGT GAGAATTAGT 
GTTGCCGAAT CGGCCGATGC GTTGCGGGCA ATTGAGCAGG CCGGCATCAG CGATCGGAAT
GTCTTTCGTT TGGCGTTGCA GACGGCTCTG ATCAAGGAAC GACAAGACCA AGCCATCTTC
AACGAACTCT TTCCGCTCTA CTTTGGCAAA GACTCGCCGC CGCCGCTGCA GCAGGCCGGC
GGTGGTCAGC TCTCGCCCGA AGAGCAGCAG CAACTTATGC ATCAGTTGCA GCAGCTCCTT
GCCCAGTTTC CCCCCGGCCC GCTGAGCCAG CTTTTTCAGA GTATGGTTAG TGGTCAACCG
CTCAGTAATC AGCAGATTCG GGCGATGTTG GCCAACGTCT CGCCGCCTCA TCTGACCAAT
CCGCGCTACC GCGATTGGAT GGCGCGACAG GCAATGCGTG AATTGCAGAT GAATCGGCTG
CAGCAGATGT TGCGCCAATT GCTCGAACAG TTGCGCGCAC AAGGGATGCG CGAAGAGGCA
CTGCGGGCGA TTGAACAAGC TGCCCGTGAG AATCTCGCGA CGCTCGAACA GCAGATTGGT
CAGCAGGTTG CCCAACAGAT GCAAGAACAG GCCCAAGGTC AAGGGCCACG CCAGAGAAGG
GGGCTGCCAA GTGAGCGTGA ATTGCTCGAT ATGCCGCTCG AACAGCTCGA TGAGAGTCTG
TTGCCCGAAA TGCGTACCCT TGTGCGCAAA CTGGCTGCAC GTCTCCGGAC TCGACTGGCG
TTACGTCAGC GTCGTGGAAA GACCGGTACG CTCGATGCGA AGGCCACCAT CCGCACTAAT
CAGCGCTTCG GCGGCGTCCC GATGTTAGTA CGTCATCGCA AGCGTCATCT CAAGCCGAAG
CTGGTCATTC TGTGCGATCG CAGCGTGAGT ACCCAGCACG TCATGTCGTG TATGCTGTTG
ATGATCTACG CCCTGCACGA TCAGGTGAGC CGTACTCGCT CGTTTGCCTT CATCGACCGG
CTGTACGACA TGTCGCACTA CTTTACCGAA TCACGCCCCG AACAGGCAAT CACACAAGTA
TTGACCGAAA TTCGTCCTAC CCGCAGTTAT AGCACCGATC TCGGTAACGC TCTCGCCGAG
TTCTGCCGCG ATCAACTGCA TCTGGTTGAT CGGCGTACAA CAGTGATCGT GCTTGGTGAT
GGCCGTAACA ACGAGAATGA TCCGAATCTG CCGGCGTTTG AGCAGATTCG ACGGCGAGCG
CGGCGGATTG TCTGGTTTGC AACTGAAGAA CGATGGAAGT GGGGTGTCTA CGATCCCGGT
TCACTGAGCA GTGACATCTA CAAATATGCA CCGATGTGTG ATGCAATGCA TGAGGTGACG
ACGCTACGTC AGTTGGCAAC CGCAATTGAC CGACTGTTTC TACATCCGTG A
 
Protein sequence
MDRRITEFIA GLRAAGVRIS VAESADALRA IEQAGISDRN VFRLALQTAL IKERQDQAIF 
NELFPLYFGK DSPPPLQQAG GGQLSPEEQQ QLMHQLQQLL AQFPPGPLSQ LFQSMVSGQP
LSNQQIRAML ANVSPPHLTN PRYRDWMARQ AMRELQMNRL QQMLRQLLEQ LRAQGMREEA
LRAIEQAARE NLATLEQQIG QQVAQQMQEQ AQGQGPRQRR GLPSERELLD MPLEQLDESL
LPEMRTLVRK LAARLRTRLA LRQRRGKTGT LDAKATIRTN QRFGGVPMLV RHRKRHLKPK
LVILCDRSVS TQHVMSCMLL MIYALHDQVS RTRSFAFIDR LYDMSHYFTE SRPEQAITQV
LTEIRPTRSY STDLGNALAE FCRDQLHLVD RRTTVIVLGD GRNNENDPNL PAFEQIRRRA
RRIVWFATEE RWKWGVYDPG SLSSDIYKYA PMCDAMHEVT TLRQLATAID RLFLHP