Gene Cagg_2001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2001 
Symbol 
ID7266282 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2447263 
End bp2449053 
Gene Length1791 bp 
Protein Length596 aa 
Translation table11 
GC content58% 
IMG OID643566832 
Producthypothetical protein 
Protein accessionYP_002463325 
Protein GI219848892 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.021099 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTCGGT GTTCCCTTGC GCGCATCCTT TTCATCTGCG GCCTACTGGT GGTCCTGATC 
GGTGAATATC GATCGGTTCC CAGCGTTGTG GCCCTCACCC AACCGCAACA GGTGACGGCA
TTTGGGATGA ATACCTACTT CAGCGGTCTG GAGCGCTTGC CGCAGAATCG TAACGATGAT
GTAATGGCGC TGATTGATGC TACTCGGTCG TTAGGGGCTG AATGGGCGCG GGAAGAGCTC
AGTTGGGCCA ATCTCGAACC GAGCAAGGGA CTCTTTACGT GGGAACTGAT GGATGCAGCC
CTGACGCAGA CTGCCAACGC CGGGTTAGGA ATTATCGGCA TGTTACTAAC AACACCGGCA
TGGGCGCGGG TTGGCGATTG CAGCAGCCGG ATCACGCGCA ACGGTGGCTC GCTCAATTAC
TGGTGCCCGC CGGCCAACCC GCAGGATTTC GCCGATTTTG TTCGCACGGT GGTTGAGCGT
TACAATGGTG ATGGCTACAA TGATGCGCCG GGTTCACCAA GGGTAGCGGC ATGGCAAATC
TGGAACGAGC CAAACAACTG GACAACCTGG CCGGGTGAAG CGCACGAATA CGGCGCACTG
TTGGTCGCCG GTTACACCGC GGCCAAAGCT GCCGACCCAA CCGCACTCGT TGCAACCGGT
GGGGTCTACG TTTTTGACGG TGGCACACGC ATAGGCGGAA ATCGTGATGG TCTCGAATTT
CTCGGTGCAG CATTTACGGC TGTTCCTAAT GCCCAGACGA GCTTTGATGC ACTGGCTATT
CATCCCTACA TGCCCGATAC CACGCCCGAT CGCGCCGGTC TGTATGGCTT AGTTACGCTC
TGGGGGCGGA TCAGTAATGT GCGAGGCTGG CTCAATGACA AGCGTGGCTC ACACGTGCCG
ATTTGGATCA GTGAGTTAGG CTGGTCAACC TGTACCTCTA CTGTAGACGT CTGTCACAGC
GAGCAAGAGC AAGCCGATTG GCTAGTGCGA AGCCACGGCA TCGCGCTCGC ATTAGGCGTG
CAGCATATCA ACTGGTTCCA GCTTGAAGAT AAGTTTGATA GCCCAGCCGG TGATCAGTGG
GGCAACATGG CCTTGCTGCG CAATCGGAGC CAAGGCTATA CCCGCAAGCT CGCTGCTCAT
GCCTACGCTA CCCTCACGGC GCAGCTTGGC AGGGCAACGT TTATTGGTTT TGGGCCATTA
CATAGCTACG TCCATCAGAA CAACGCCCTT ATCCCCGCCG CCCGTTACCA TCTCCGCTTC
CAAACCGCGA CCGGCGCGCT GGTTGATCTA TTGTGGACGA CCGGCAGCGC CGAGACGCGC
ACCATGCCGG TTGAGGCCGG TCGCAGCGCG CAACTGCTCA GCCGTGATGG GGCAACGCTG
CCGCTAACGA TCAGTAGCGG GCAGGCTCAA ATTCCGCTTA GTGGAACACC GGTCTATCTC
CGGCAAGACA CACCGCCACA ATTGGGTGTC ACACCGACCG ATGTCACCAT TCCAATGCTG
ACGACCGATC CGGCCACCAC GTATACGCTC TCGGTTAAGA ACCTCGGGTC GGGCAGTATC
GGTTGGACTG CCGTTGGTAG CGCGAGTTGG CTGACACTCA TAACGACGAA TGGGAGTGGT
CATCGTAGTA CCTTACAATA CCGCATCGAT CCCAACGGGT TGGCAACCGG TGATTATACG
ACAACCATCA CCGTCAATGC CGGGAGTGCC GGAGTGCAGA GTATCCCGGT CACGTTGCGC
GTCGTCAACA CTATCTACCG CGTCTATACA CCGCTAGTTA CCCGGCGTTG A
 
Protein sequence
MVRCSLARIL FICGLLVVLI GEYRSVPSVV ALTQPQQVTA FGMNTYFSGL ERLPQNRNDD 
VMALIDATRS LGAEWAREEL SWANLEPSKG LFTWELMDAA LTQTANAGLG IIGMLLTTPA
WARVGDCSSR ITRNGGSLNY WCPPANPQDF ADFVRTVVER YNGDGYNDAP GSPRVAAWQI
WNEPNNWTTW PGEAHEYGAL LVAGYTAAKA ADPTALVATG GVYVFDGGTR IGGNRDGLEF
LGAAFTAVPN AQTSFDALAI HPYMPDTTPD RAGLYGLVTL WGRISNVRGW LNDKRGSHVP
IWISELGWST CTSTVDVCHS EQEQADWLVR SHGIALALGV QHINWFQLED KFDSPAGDQW
GNMALLRNRS QGYTRKLAAH AYATLTAQLG RATFIGFGPL HSYVHQNNAL IPAARYHLRF
QTATGALVDL LWTTGSAETR TMPVEAGRSA QLLSRDGATL PLTISSGQAQ IPLSGTPVYL
RQDTPPQLGV TPTDVTIPML TTDPATTYTL SVKNLGSGSI GWTAVGSASW LTLITTNGSG
HRSTLQYRID PNGLATGDYT TTITVNAGSA GVQSIPVTLR VVNTIYRVYT PLVTRR