Gene Cagg_1119 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1119 
Symbol 
ID7268573 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1376723 
End bp1379815 
Gene Length3093 bp 
Protein Length1030 aa 
Translation table11 
GC content60% 
IMG OID643565962 
Producthypothetical protein 
Protein accessionYP_002462465 
Protein GI219848032 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000327751 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAACGACG ATACACCGAT GTTCGATATG GAAGAGCAGA CCACACCGCA CGCTCCGGTG 
GAATGCCTGG GGATGACCTT TGATAACGAT GATGCGCGGC GCGTATTCTT TCTCGACCGC
TTGCGCGCCG CCCTTGAGGA ACTGCACGCC AGGCTCGGTG GTGTCCCCTT CACGACCGTG
GCCGATGCCG TGGAACGGAT GAAGTCGCTC ACGCACTGGC CGATGGGTGA TGATGAACGG
CTCCGCGAAC TGGCCGAACA GATGCGCAAA GCCCACCGCT CGGCGCCCGC CACCGACCTG
CTCCGGCTCT GGAAGGATGC GGTCGGCTTT CCGCACGGGA AGATCGAGGA TATTCTAAAC
CTCTCCGATC CGCCGTATTA CACGGCATGC CCCAACCCGT TCATTGGCGA TTTCATCCGC
TCCTACGGCA AACCCTACGA CCCGCAGACC GACGACTACC GGCGGGAGCC GTTTGCGGCG
GACGTGAGCG AAGGCAAGAA CGACCCCATT TACAACGCGC ACTCCTACCA TACCAAGGTG
CCGCACAAGG CCATCATGCG GTACATCCTC CATTACACCG AACCGGGGGA TGTTGTCTTT
GACGGCTTCT GTGGCACGGG GATGACCGGC GTGGCGGCAC AACTGTGCGG CGACCGCACC
ACGGTGGAGT CGCTTGGCTA CAAGGTGGAT GATGCAGGCA CCATCTACCG GCCGGAACAG
GATGAAAGCG GCAAAACGGT CTGGACGCCG TTCTCGAAAC TGGGCGCGCG GCGCGCCGTG
CTCAACGACC TTTCGCCGGC GGCGACGTTC ATTGCCTACA ACTACAACAC CCCGGTGGAC
GTGCGCGCCT TCGAGCGCGA GGCCAACCGG ATACTGAAGG ACGTGGAAGC CGAATGCGGT
TGGATGTACG CCACCCTCGC CACTACCAAT GCGCACGAGG CGGCGATGTG GGCCGAGCGC
CTGCGCGCAT GCCGCACGGC CGACGACGCG CGGGCGCTCA TTGCGTCGAT TCCGAATCGC
GGGACGATCA ACTACACCGT CTGGTCGGAT GTGTTCGTCT GCCCGGAATG CACCGAGGAG
GTGGTCTTTT GGCAGGCGGC CGTCGACCAC GAGGCGGGCA AGGTGCGCGA TGCATTCCCC
TGCCCCCACT GCGGCGCCAT CCTCACCAAG CGCACGATGG AGCGTGCATG GGTGACGACG
TATGACCGGG CCCTCGGCCA GACCATTCGC CAAGCCAAAC AGGTGCCGGT GCGGATCACC
TACCGCGTGG GCAACACGCG CTACGAGAAA GCGCCCGATG CCTTCGACCT GGCGCTGATC
GCGAAGATCG AGGAGCTGGA CATTCCCTAC TGGTTCCCGA CCGACCGCAT GCCGGAGGGT
GAGGAGTCGC GTCGCAACGA TGATATTGGC CTCACCCACG TGCATCATTT TTATACCAAG
CGGAATTTGT GGGTGTTGGG GGCGGCTATA TATCGAGCTC TTGCGACAAA TCCACGGTTA
GGCGTATGGG TTACTTCAAC CATGATAAGA ACGACCAAAA TGTATAAGTA CATGCCTGTC
CTTAATAATG GCCAACTTAC CGACCGCCGT ACTGGAACGG TATCAGGCAC TCTTTATGTT
CCATCTATGG CTGATGAGAA TTGTCCATTA GACCTACTGG TTTCAAAAAT CCGTGATTTT
ACGTTCTCAA TATCCCGGAA CATAAATGCT GCTACTAGCA CGAATTCTGC AACTGATGTG
AATACGGGTA ACATCACGGT CGACTACATC TTCACCGACC CACCGTTCGG CGGCAACCTG
ATGTACTCCG AGCTGAACTT TCTATGGGAA GCGTGGCTGA AGGTGTTCAC CAACAACAAA
CCGGAGGCAA TTGAAAACCA GACCCAAGGG AAGGGGCTGG CCGAATACCA ACGCCTGATG
ACCGCCTGCT TCAAGGAGTA CTACCGGGTG CTCAAGCCGG GGCGCTGGAT GACGGTGGAG
TTTCACAACT CGAAGAACGC CGTCTGGAAT GCGATCCAGG AGGCGTTGCA GGCCGCCGGC
TTCGTGATCG CCGACGTGCG CACGCTCGAC AAGCAGCAGG GGTCGTTCAA GCAGGTCACC
AGCGCGAACG CCGTCAAACA GGATTTGATC ATCTCGTGCT ACAAACCCAA CGGTGGTTTG
GAAGCGCGGT TTGCCCACGA AGCAGGCACC GCAGCAGGCG TCTGGGACTT CGTCCGCACG
CATCTGAAGC ACCTGCCGGT GTTCGTCGCC AAAGACGGCA AGGCGGAAAT CATCGCCGAG
CGGCAGAACT ACCTGCTCTA CGACCGGATG GTGGCCTTCC ACGTCCAGCG CGGCGTCCTG
GTACCGCTCT CGGCGGCGGA GTTCTACGCC GGTCTCGCCC GGCGCTTCCC GGAGCGGGAC
GGCATGTACT TCCTGCCCGA TCAAGTCGTC GAGTACGACA AAAAGCGGCT GACGATCCGC
GAACTGCACC AACTGACCCT CTTCGTCACC GACGAAGCCT CGGCGATCCG GTGGCTCAAG
CAGCACCTTG AGCGCAAGCC GCAGACCTTC CAGGAGCTTC ACCCGCAATT CCTGAAAGAG
ATCGGCGGCT GGCAGAAGCA CGAAAAGAAG ATCGAGCTGC GGGAACTGTT GGAGCAGAAC
TTCCTGCGCT ACGACGGCCA GGGGCCGATC CCGGCGCAGA TCGTCTCGTG GATGAAGCAG
AGCGCCGAAC TGCGCAAGCT CATCGAGGAG GAGCGTGCGA CCGGCCGGGC GACTGAGGAG
AACGGCCAAC TCAGCACGCA ATCCGCCGTG CTGATCGCGC GGGCCAAAGA CCGCTGGTAC
GTGCCCGACC CGCACAAAGC CGGCGACTTG GAAAAACTGC GTGAGCGGGC CTTGCTGCGC
GAGTTTGATG CGTATTGTGA ATCTGGCGAG CGGCGCCTGA AGGAATTCCG CCTCGAAGCC
ATCCGCGCCG GCTTCAGAAA GGCATGGCAA GAGCGCGACT ACGACACCAT CATCGCGGTG
GCGCGCAAAA TCCCGGAGAA CGTTCTGCAG GAGGATCCTA AGCTGGTCAT GTGGTACGAC
CAGGCCATCA CCCGTTCGGG AGAGCAAGGA TGA
 
Protein sequence
MNDDTPMFDM EEQTTPHAPV ECLGMTFDND DARRVFFLDR LRAALEELHA RLGGVPFTTV 
ADAVERMKSL THWPMGDDER LRELAEQMRK AHRSAPATDL LRLWKDAVGF PHGKIEDILN
LSDPPYYTAC PNPFIGDFIR SYGKPYDPQT DDYRREPFAA DVSEGKNDPI YNAHSYHTKV
PHKAIMRYIL HYTEPGDVVF DGFCGTGMTG VAAQLCGDRT TVESLGYKVD DAGTIYRPEQ
DESGKTVWTP FSKLGARRAV LNDLSPAATF IAYNYNTPVD VRAFEREANR ILKDVEAECG
WMYATLATTN AHEAAMWAER LRACRTADDA RALIASIPNR GTINYTVWSD VFVCPECTEE
VVFWQAAVDH EAGKVRDAFP CPHCGAILTK RTMERAWVTT YDRALGQTIR QAKQVPVRIT
YRVGNTRYEK APDAFDLALI AKIEELDIPY WFPTDRMPEG EESRRNDDIG LTHVHHFYTK
RNLWVLGAAI YRALATNPRL GVWVTSTMIR TTKMYKYMPV LNNGQLTDRR TGTVSGTLYV
PSMADENCPL DLLVSKIRDF TFSISRNINA ATSTNSATDV NTGNITVDYI FTDPPFGGNL
MYSELNFLWE AWLKVFTNNK PEAIENQTQG KGLAEYQRLM TACFKEYYRV LKPGRWMTVE
FHNSKNAVWN AIQEALQAAG FVIADVRTLD KQQGSFKQVT SANAVKQDLI ISCYKPNGGL
EARFAHEAGT AAGVWDFVRT HLKHLPVFVA KDGKAEIIAE RQNYLLYDRM VAFHVQRGVL
VPLSAAEFYA GLARRFPERD GMYFLPDQVV EYDKKRLTIR ELHQLTLFVT DEASAIRWLK
QHLERKPQTF QELHPQFLKE IGGWQKHEKK IELRELLEQN FLRYDGQGPI PAQIVSWMKQ
SAELRKLIEE ERATGRATEE NGQLSTQSAV LIARAKDRWY VPDPHKAGDL EKLRERALLR
EFDAYCESGE RRLKEFRLEA IRAGFRKAWQ ERDYDTIIAV ARKIPENVLQ EDPKLVMWYD
QAITRSGEQG