Gene Cagg_1001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1001 
Symbol 
ID7268373 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1234749 
End bp1237493 
Gene Length2745 bp 
Protein Length914 aa 
Translation table11 
GC content63% 
IMG OID643565849 
Producthypothetical protein 
Protein accessionYP_002462354 
Protein GI219847921 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.724391 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGCAG AGAATAGGTT GCGATTGCCG TACTGGTGGC GCGTGAGTTG GCCCTTCCTC 
TGGTTTTTGG TCTTGACGAT TCTCTGGACG TGGCCGCTAG CGATGGAGAT GACCACCGCC
TTGCCCGGCA GCGGGGCTGA TCCGTTGTTG CAGACGTGGA TACTGGCGTG GGATGGCCAC
GCGCTGTTGC ACCATCCGGC AAGCGTCTGG GATGCGCCAA TCTTTTTTCC GTATCCGCGC
ACGTTGACGT ACAATGACCA CCATCTGCTC TGGGCAGTGG TGATGTTGCC GGTCTTGGCG
TTCGGCCAGC CGGTGGCGGC GTACAACGCG CTGTTGTTGC TGAGTTTTGC CCTCAGCGGC
TGGGCGGTCT GTCTGCTCAC CCATGATATC CTCGCTGAGC ACACCGAAGA ACCGGCGGCG
ACCATCGGCG CGTTGCTGGC CGGTGGTTTG TTTGCCTTCA GTACCTACCG GATGGCGCAT
TTGGCGCATC TCAACTTGCT CCAAACGGCG TGGTTGCCAT TGGCGTTGTA CTGGCTGCGC
CAGATGGCGG TGCGGTCGGG CGGGGCGTTT TGGCGAGCGG CGGCGTTGAC CGGCGTGTTT
GCCGGTGTGC AAGCGGTGAC AGCGGTCTAC TACGCGCCGA TGGCGGCGCT GGCGGTGGGG
GTAGCGGCTT TCGTGTGGTT GTGGCCGGCG CGCTGGCGGT CGCCATTGGC ACGACGGGCG
GTGGGGCGCA GCATTGGTGG ATTGGCGCTC GCCGGCGCGG TGGCAGTGCT GATCGCGCTG
CCGTTTCTGC TGCCCTATAT GATGGTCTAT GCCTATCTCG GCATTGTGCG TTCGCCGGGG
GAGCTGGTGC GGTGGTCGGC GCCGCTGACG GCGTATCTGT CGGTGCCGGC GGGGAATGTG
CTGTATGGTG CCGGGTTGAC GCCGGTGCAG CCGGGCAGCG AGCAGGAGCT GCTGCTGTTT
CCGGGGGCGC TGGCGGTGGT GTTGGCCGGG ATCGGCGGCT GGCGGTGGTG GCGCGCGGCG
CGGCGTGACG GGTTGGCGCT GCTGTTGATC GGGGCGGCGG GCGTGGTGTT GTCGCTGGGG
GTGGCGTTGC GGGTGCGCAG CGATGAGGCG GGCGTGGTGG CGGCGTTGCC GTATGGCTGG
TTGTACGAGC GAGTGCCGGG GTTTAACGCG CTGCGGGTGC CGGCACGGTG GTGGATGCTT
GGCAGCTTGG CGGTCGCGGT GCTGGCGGGG ATGGGGGCGG CGTGGCTGTG GCGGGGATGG
GGGCGCTGGC TCGTGCCGGT GGTGGGGGTG TTGGCGCTGC TTGAACATCT GGTATGGCCC
ATCCCGCGGA TCGAGCTGCC CGCGCCGCCG CCGGTCTATC AATGGCTAGC GCAATCGTCG
CCGTCTGCTC ATCGCGTTGT GCTAGAATTG CCAGCGGAAG CAGCCGAAGA GGCGACCCCG
GTGCGCGCGT GGTACCAGTT CTTTCAAATC ACGCACTGGC GCACGTTAGT CAACGGTTAT
AGTGGGTTGA CCCCGGCAGG ATCGCTTGAT GTCGTGCGGC GATTGCGCCG GTTGCCGGAT
GATGATACGG TGCGATATAT AGCCCGCTTG GGCGTCGATA CACTGATTAT TCATCGTGAT
CGATACGATG AACCAGCCAA GTTGGCACAT CTCTTGCAGT GGGCACAGGC CACGCCGTGG
CTTGAACCGC AGGGTGTCTT TGCCGATGCT ATCGTTTATG CGATCAAACC CGATCCGTCG
TTAGAAACGC TGGTGTCAGC CGGTGATCGC ATCTATATCG ACAACAATGA TCGCATTCCG
GGTATGGTAG CGCTCACCTT AGCCCACCGC TGGCAGACAG CGGGCGCAAT GGTTTACGGG
CTGAAACGGT TGCGCTACTA CCCAGCGCTT GCCACCCCAC CGGATGGGCA GTTGTTTGAT
TACGTTGTCT TAGCACGCGG GACGGATCCG CGTCCGTTTG GCACAATGCC GGCGCTAGTG
CGGTGGCAAG AGGAGGGGGT GGCGGTCTAT GCCGTTCCTG CTGAGCTGTT GGCAACGCAG
GAGTTAGGCG CGCCGGATAT CGGCCAATTT CACCCGCGCC ATCCGGCGAC GTTGACGGTG
CGGTTGCGTG GTGATAGCGT GGAAGTTGGC CGGAGCCGGA TCGCCTTGCC TACCGTGGTG
GAAGCCGCGA CCCTGCTGCT TGATGTGGCG AGTCTGACCG AGCAAGAGGT GCAGGTGGGC
GAGACGAAGC AGCGGTTGGC TGCCGGCGGG CAGACGTTGA CTGTGCCGAT CAGCCGTGAT
CAACCGGTAA CGATCAGTGG CGACGCGGCG ACGTTCAGCA TCGTGCGTGT GCAGCTCTGG
CGGGGAATGC CGGCGCTGGA TGCAGTGGGT GGCGTGGCGT TGACGGTTGA GAGCGCGTTT
GCTGGCTCGC AGTTACAGTT GCTGGCGCGC TTGAGCAGGG CGACTGAAGT GACCCTTGAA
ATATACGGTT CAACACCGTG GTATGAAAAA CCGGTGCATT TGTTGACCGG CAAGCTGACA
ACGACTGATA GTACAGAGCC GTCAACCCTG ACCGTTGATT TGATTCAGCC ATCGGCGCCA
TGGCTTGAAT ATGCGGTGCC AGCGGTAGAT GGCCGCTACA TCGCTTATCT ACGGATTGCC
GGTCAAGCCA GCACGGACGG ATTGCCGGTT GCTAAGTTTA CCGTGCGCAA TGGGCAAGTG
GTTGATGCGC AGCCGTTGCC GGCCCCATTG ACGATTGTGC GCTGA
 
Protein sequence
MTAENRLRLP YWWRVSWPFL WFLVLTILWT WPLAMEMTTA LPGSGADPLL QTWILAWDGH 
ALLHHPASVW DAPIFFPYPR TLTYNDHHLL WAVVMLPVLA FGQPVAAYNA LLLLSFALSG
WAVCLLTHDI LAEHTEEPAA TIGALLAGGL FAFSTYRMAH LAHLNLLQTA WLPLALYWLR
QMAVRSGGAF WRAAALTGVF AGVQAVTAVY YAPMAALAVG VAAFVWLWPA RWRSPLARRA
VGRSIGGLAL AGAVAVLIAL PFLLPYMMVY AYLGIVRSPG ELVRWSAPLT AYLSVPAGNV
LYGAGLTPVQ PGSEQELLLF PGALAVVLAG IGGWRWWRAA RRDGLALLLI GAAGVVLSLG
VALRVRSDEA GVVAALPYGW LYERVPGFNA LRVPARWWML GSLAVAVLAG MGAAWLWRGW
GRWLVPVVGV LALLEHLVWP IPRIELPAPP PVYQWLAQSS PSAHRVVLEL PAEAAEEATP
VRAWYQFFQI THWRTLVNGY SGLTPAGSLD VVRRLRRLPD DDTVRYIARL GVDTLIIHRD
RYDEPAKLAH LLQWAQATPW LEPQGVFADA IVYAIKPDPS LETLVSAGDR IYIDNNDRIP
GMVALTLAHR WQTAGAMVYG LKRLRYYPAL ATPPDGQLFD YVVLARGTDP RPFGTMPALV
RWQEEGVAVY AVPAELLATQ ELGAPDIGQF HPRHPATLTV RLRGDSVEVG RSRIALPTVV
EAATLLLDVA SLTEQEVQVG ETKQRLAAGG QTLTVPISRD QPVTISGDAA TFSIVRVQLW
RGMPALDAVG GVALTVESAF AGSQLQLLAR LSRATEVTLE IYGSTPWYEK PVHLLTGKLT
TTDSTEPSTL TVDLIQPSAP WLEYAVPAVD GRYIAYLRIA GQASTDGLPV AKFTVRNGQV
VDAQPLPAPL TIVR