Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_1119 |
Symbol | |
ID | 7268573 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | + |
Start bp | 1376723 |
End bp | 1379815 |
Gene Length | 3093 bp |
Protein Length | 1030 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643565962 |
Product | hypothetical protein |
Protein accession | YP_002462465 |
Protein GI | 219848032 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.0000327751 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGAACGACG ATACACCGAT GTTCGATATG GAAGAGCAGA CCACACCGCA CGCTCCGGTG GAATGCCTGG GGATGACCTT TGATAACGAT GATGCGCGGC GCGTATTCTT TCTCGACCGC TTGCGCGCCG CCCTTGAGGA ACTGCACGCC AGGCTCGGTG GTGTCCCCTT CACGACCGTG GCCGATGCCG TGGAACGGAT GAAGTCGCTC ACGCACTGGC CGATGGGTGA TGATGAACGG CTCCGCGAAC TGGCCGAACA GATGCGCAAA GCCCACCGCT CGGCGCCCGC CACCGACCTG CTCCGGCTCT GGAAGGATGC GGTCGGCTTT CCGCACGGGA AGATCGAGGA TATTCTAAAC CTCTCCGATC CGCCGTATTA CACGGCATGC CCCAACCCGT TCATTGGCGA TTTCATCCGC TCCTACGGCA AACCCTACGA CCCGCAGACC GACGACTACC GGCGGGAGCC GTTTGCGGCG GACGTGAGCG AAGGCAAGAA CGACCCCATT TACAACGCGC ACTCCTACCA TACCAAGGTG CCGCACAAGG CCATCATGCG GTACATCCTC CATTACACCG AACCGGGGGA TGTTGTCTTT GACGGCTTCT GTGGCACGGG GATGACCGGC GTGGCGGCAC AACTGTGCGG CGACCGCACC ACGGTGGAGT CGCTTGGCTA CAAGGTGGAT GATGCAGGCA CCATCTACCG GCCGGAACAG GATGAAAGCG GCAAAACGGT CTGGACGCCG TTCTCGAAAC TGGGCGCGCG GCGCGCCGTG CTCAACGACC TTTCGCCGGC GGCGACGTTC ATTGCCTACA ACTACAACAC CCCGGTGGAC GTGCGCGCCT TCGAGCGCGA GGCCAACCGG ATACTGAAGG ACGTGGAAGC CGAATGCGGT TGGATGTACG CCACCCTCGC CACTACCAAT GCGCACGAGG CGGCGATGTG GGCCGAGCGC CTGCGCGCAT GCCGCACGGC CGACGACGCG CGGGCGCTCA TTGCGTCGAT TCCGAATCGC GGGACGATCA ACTACACCGT CTGGTCGGAT GTGTTCGTCT GCCCGGAATG CACCGAGGAG GTGGTCTTTT GGCAGGCGGC CGTCGACCAC GAGGCGGGCA AGGTGCGCGA TGCATTCCCC TGCCCCCACT GCGGCGCCAT CCTCACCAAG CGCACGATGG AGCGTGCATG GGTGACGACG TATGACCGGG CCCTCGGCCA GACCATTCGC CAAGCCAAAC AGGTGCCGGT GCGGATCACC TACCGCGTGG GCAACACGCG CTACGAGAAA GCGCCCGATG CCTTCGACCT GGCGCTGATC GCGAAGATCG AGGAGCTGGA CATTCCCTAC TGGTTCCCGA CCGACCGCAT GCCGGAGGGT GAGGAGTCGC GTCGCAACGA TGATATTGGC CTCACCCACG TGCATCATTT TTATACCAAG CGGAATTTGT GGGTGTTGGG GGCGGCTATA TATCGAGCTC TTGCGACAAA TCCACGGTTA GGCGTATGGG TTACTTCAAC CATGATAAGA ACGACCAAAA TGTATAAGTA CATGCCTGTC CTTAATAATG GCCAACTTAC CGACCGCCGT ACTGGAACGG TATCAGGCAC TCTTTATGTT CCATCTATGG CTGATGAGAA TTGTCCATTA GACCTACTGG TTTCAAAAAT CCGTGATTTT ACGTTCTCAA TATCCCGGAA CATAAATGCT GCTACTAGCA CGAATTCTGC AACTGATGTG AATACGGGTA ACATCACGGT CGACTACATC TTCACCGACC CACCGTTCGG CGGCAACCTG ATGTACTCCG AGCTGAACTT TCTATGGGAA GCGTGGCTGA AGGTGTTCAC CAACAACAAA CCGGAGGCAA TTGAAAACCA GACCCAAGGG AAGGGGCTGG CCGAATACCA ACGCCTGATG ACCGCCTGCT TCAAGGAGTA CTACCGGGTG CTCAAGCCGG GGCGCTGGAT GACGGTGGAG TTTCACAACT CGAAGAACGC CGTCTGGAAT GCGATCCAGG AGGCGTTGCA GGCCGCCGGC TTCGTGATCG CCGACGTGCG CACGCTCGAC AAGCAGCAGG GGTCGTTCAA GCAGGTCACC AGCGCGAACG CCGTCAAACA GGATTTGATC ATCTCGTGCT ACAAACCCAA CGGTGGTTTG GAAGCGCGGT TTGCCCACGA AGCAGGCACC GCAGCAGGCG TCTGGGACTT CGTCCGCACG CATCTGAAGC ACCTGCCGGT GTTCGTCGCC AAAGACGGCA AGGCGGAAAT CATCGCCGAG CGGCAGAACT ACCTGCTCTA CGACCGGATG GTGGCCTTCC ACGTCCAGCG CGGCGTCCTG GTACCGCTCT CGGCGGCGGA GTTCTACGCC GGTCTCGCCC GGCGCTTCCC GGAGCGGGAC GGCATGTACT TCCTGCCCGA TCAAGTCGTC GAGTACGACA AAAAGCGGCT GACGATCCGC GAACTGCACC AACTGACCCT CTTCGTCACC GACGAAGCCT CGGCGATCCG GTGGCTCAAG CAGCACCTTG AGCGCAAGCC GCAGACCTTC CAGGAGCTTC ACCCGCAATT CCTGAAAGAG ATCGGCGGCT GGCAGAAGCA CGAAAAGAAG ATCGAGCTGC GGGAACTGTT GGAGCAGAAC TTCCTGCGCT ACGACGGCCA GGGGCCGATC CCGGCGCAGA TCGTCTCGTG GATGAAGCAG AGCGCCGAAC TGCGCAAGCT CATCGAGGAG GAGCGTGCGA CCGGCCGGGC GACTGAGGAG AACGGCCAAC TCAGCACGCA ATCCGCCGTG CTGATCGCGC GGGCCAAAGA CCGCTGGTAC GTGCCCGACC CGCACAAAGC CGGCGACTTG GAAAAACTGC GTGAGCGGGC CTTGCTGCGC GAGTTTGATG CGTATTGTGA ATCTGGCGAG CGGCGCCTGA AGGAATTCCG CCTCGAAGCC ATCCGCGCCG GCTTCAGAAA GGCATGGCAA GAGCGCGACT ACGACACCAT CATCGCGGTG GCGCGCAAAA TCCCGGAGAA CGTTCTGCAG GAGGATCCTA AGCTGGTCAT GTGGTACGAC CAGGCCATCA CCCGTTCGGG AGAGCAAGGA TGA
|
Protein sequence | MNDDTPMFDM EEQTTPHAPV ECLGMTFDND DARRVFFLDR LRAALEELHA RLGGVPFTTV ADAVERMKSL THWPMGDDER LRELAEQMRK AHRSAPATDL LRLWKDAVGF PHGKIEDILN LSDPPYYTAC PNPFIGDFIR SYGKPYDPQT DDYRREPFAA DVSEGKNDPI YNAHSYHTKV PHKAIMRYIL HYTEPGDVVF DGFCGTGMTG VAAQLCGDRT TVESLGYKVD DAGTIYRPEQ DESGKTVWTP FSKLGARRAV LNDLSPAATF IAYNYNTPVD VRAFEREANR ILKDVEAECG WMYATLATTN AHEAAMWAER LRACRTADDA RALIASIPNR GTINYTVWSD VFVCPECTEE VVFWQAAVDH EAGKVRDAFP CPHCGAILTK RTMERAWVTT YDRALGQTIR QAKQVPVRIT YRVGNTRYEK APDAFDLALI AKIEELDIPY WFPTDRMPEG EESRRNDDIG LTHVHHFYTK RNLWVLGAAI YRALATNPRL GVWVTSTMIR TTKMYKYMPV LNNGQLTDRR TGTVSGTLYV PSMADENCPL DLLVSKIRDF TFSISRNINA ATSTNSATDV NTGNITVDYI FTDPPFGGNL MYSELNFLWE AWLKVFTNNK PEAIENQTQG KGLAEYQRLM TACFKEYYRV LKPGRWMTVE FHNSKNAVWN AIQEALQAAG FVIADVRTLD KQQGSFKQVT SANAVKQDLI ISCYKPNGGL EARFAHEAGT AAGVWDFVRT HLKHLPVFVA KDGKAEIIAE RQNYLLYDRM VAFHVQRGVL VPLSAAEFYA GLARRFPERD GMYFLPDQVV EYDKKRLTIR ELHQLTLFVT DEASAIRWLK QHLERKPQTF QELHPQFLKE IGGWQKHEKK IELRELLEQN FLRYDGQGPI PAQIVSWMKQ SAELRKLIEE ERATGRATEE NGQLSTQSAV LIARAKDRWY VPDPHKAGDL EKLRERALLR EFDAYCESGE RRLKEFRLEA IRAGFRKAWQ ERDYDTIIAV ARKIPENVLQ EDPKLVMWYD QAITRSGEQG
|
| |