Gene Cagg_0834 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0834 
Symbol 
ID7268286 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1036954 
End bp1038603 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content55% 
IMG OID643565684 
Productprotein of unknown function DUF344 
Protein accessionYP_002462193 
Protein GI219847760 
COG category[S] Function unknown 
COG ID[COG2326] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTGATC GCTGTATTAC CGATGTGTCG CTCTCTAAAG CGGAGTACCA GCGTTTAGTC 
CCTGAGTTGC AAGCGCGGCT CTTTGATCTG GAGCAGATGC TGCTCGAAGC GCGCATCCCG
ACCATTTTCG TGTTTGAGGG CTGGGCCGGA ACGGCCAAGG CGCGCACGAT TGCGACACTT
ACCCGTCGGC TTGATCCGCG TGGTTTTCGG GTGTATCCGA TCACGCCGCC ACGAACCTAC
GAGCAGCAGT ATCCGTGGCT CTATCGCTTC TGGCTCAAGA TTCCCAGCTA TGGTCAGATG
ACATTCTTTG ATCGGTCGTG GTATCGTGAA TTGCTGGCTG CCTATACGAC CGACGGTGAT
CAAGATCGTT GGCGGACGCG CTGCGAAGAT GCGGTTGTTT TCGAGCGCCA ATTGGCCGAT
GATGGGGCAT TCATCCTTAA GTTTTGGCTC CATATTACCA AAAAGCAGCA GGCTCGTCGC
TTTAAGAAGT TGTTGTCCGA TCCGTTGCAG TCGTGGCGGG TAACCGATGA GGATCGTTGG
CAACACCGTC ACTACAAGCG TGTCTACCGC GTAGTCGAGG AGATGCTGGT ACGCACCGAT
ACCGCGTTTG CACCGTGGCA AATTGTTCCG GCGGCCGATA AATACTATGC GCGTTTGTAC
ATTTTGCAGA CGATTGTCGG TGCGCTGGAA AGTCGCTTGG GCATCACTGC GATTGATCGG
GGCGCCAGTA TTGATGATAG TGGTGAAGCA CTCCGCCGCT ACAACTTGTC GATCCGAATA
CCGGTGCTGG GTGGTGCGAC CAACACAGAC ACGGTTCGTC CGTCACCATC GGCGGAAGAG
GCCGGTCATC AGCTCACCAC AACACCAATG TCAAACGGAT CGGTGGTGGT AACGGTCCCG
GTAGTGTCAC CAACCTACGC GGCTAGTCCG TTGCAACGGG TTGATCTCAG TCTGCGCCTC
GACGATGAGA CCTATCATCG TGAGTTGAAA CGGTTGCAGG CTAAGCTGTA CTTGCTAGGG
TTGCAAGTCT ACCATCAGAA ACGACCGGTG GTGATAGTGT TTGAGGGGTG GGATGCCGCC
GGTAAAGGTG GGGCGATCCA GCGTCTGACT GCTGAACTCG ATCCACGGGC GTATATTGTG
CATGCGATTG CAGCACCAAC CGGCGATGAC AAAGCGCGCC ACTACCTCTA CCGCTTTTGG
CGACGCTTGC CACCGCGTGG TCAGTTTGCG GTGTTCGATC GCTCGTGGTA CGGTCGGGTC
TTGGTTGAGC GGGTCGAAGG GTTCGCGCGG CCTGAGGAAT GGCGACGGGC CTACGCCGAA
ATTAATCAGT TTGAACGTCA GTTGGTCGAT TTCGGCACTA TCATCGCGAA GTTTTGGTTA
CACATCAGCC CTGAAGAGCA GTTACGTCGG TTTGAGCAAC GACAGAATGT GCCGTACAAA
GCGTGGAAAT TGACCGACGA AGATTGGCGT AATCGTGAGA AGTGGCCGGC GTATCTCGCG
GCAGTTGATG AGATGTTACT GCGCACCAGT ACACCATTTG CCCCGTGGAC GATAGTTGAA
GCGGAGGATA AGAAGTTTGC TCGGATCAAG GTGTTACGGA CAGCGGTTGA TGTATTAGAG
TCTGAGTTGG GAGTTGTAAA GCTGGAGTAG
 
Protein sequence
MLDRCITDVS LSKAEYQRLV PELQARLFDL EQMLLEARIP TIFVFEGWAG TAKARTIATL 
TRRLDPRGFR VYPITPPRTY EQQYPWLYRF WLKIPSYGQM TFFDRSWYRE LLAAYTTDGD
QDRWRTRCED AVVFERQLAD DGAFILKFWL HITKKQQARR FKKLLSDPLQ SWRVTDEDRW
QHRHYKRVYR VVEEMLVRTD TAFAPWQIVP AADKYYARLY ILQTIVGALE SRLGITAIDR
GASIDDSGEA LRRYNLSIRI PVLGGATNTD TVRPSPSAEE AGHQLTTTPM SNGSVVVTVP
VVSPTYAASP LQRVDLSLRL DDETYHRELK RLQAKLYLLG LQVYHQKRPV VIVFEGWDAA
GKGGAIQRLT AELDPRAYIV HAIAAPTGDD KARHYLYRFW RRLPPRGQFA VFDRSWYGRV
LVERVEGFAR PEEWRRAYAE INQFERQLVD FGTIIAKFWL HISPEEQLRR FEQRQNVPYK
AWKLTDEDWR NREKWPAYLA AVDEMLLRTS TPFAPWTIVE AEDKKFARIK VLRTAVDVLE
SELGVVKLE