Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_3396 |
Symbol | |
ID | 7267136 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | - |
Start bp | 4122518 |
End bp | 4125745 |
Gene Length | 3228 bp |
Protein Length | 1075 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 643568205 |
Product | TPR repeat-containing protein |
Protein accession | YP_002464676 |
Protein GI | 219850243 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCTCCC AGAGTACGAT CGGTCGCTGG AGTGTACGGG TGATTGAGGC TAGCTGGTTA TTAGCCCTGA CACTCGTGCC TATTTATTTT AACCTCTATT CGGCCCGCCA TTTTGAACCT GACAAGGCGG CTGTGTTGCG CTCACTTGCG TTGATCGGTT TCGCCGCCGG GTTAATTTGG TTACTTGATT GGCTTGGTCA GCGCCCTTCC AATACCCGAC CGGCGTCGTG GTGGACAACC GTGCATGCCT CACCGCTCTT CTGGCCAACT GTGGCCTACA CGGCGGTATT TGCTTTTACA ACGATAACGT CCATCACGCC CACAGTTAGC CTGTGGGGAT CATACCAACG CATGCAGGGG TTGTACACCT ACCTTTCCTA CGTGACGTTA GGGGTGTTAG TCGCCGTCGC CTTACGCACA CCGGCCCAAC GTGAGCGCTT TATCACAATA AGTCTGGCTG CTGCCACCGT CGTATCGGTG TACGGGATCA TGCAGCATTA TCAGCTTGAT CCGTTGCCGT GGCGCGGTGA TGTGATTGCC CGCGTCGCCT CAACGCTGGG GAATTCGATT TTCGTAGCAG CCTACTTGAT CATGATCGTG CCACTGGCAC TCTACCGACT GGGTGCGGCT CTGGCCGCTG CGCAGGTCGC CCCAAGAAGC GATAGGCCGG CAGCCGAATG GCGCTGGGCG TTGTCACGGG TTTCACTCTG GTTGAGCGGC ACCCTCCTGA TTATCGCGAT CTTGAAGTTT AGTGTCGCAA TTCGGACAAT TGACTTTCGT TATTGGTGGC TCTTCCCAGG AGCAATCATC TGTGCGACTG CCCTCTGGTG GCTACTAACA GCTCGTCGTA GCCAAGCACT CCCACGCTGG CCACTGGTTC TTACCCTCCT GTATCTGCTA GGCTTTGCAT TAGCGTTCGT TTTGAACGCA TCTGCCGGCG TACAGCAATT TGCATTACCC GACATCGCAG CAAATGCGCT TGATTGGAGC GTATGGCTCG GATTAGCGAT AGCTGCCTTA GTCATCGGCT ATGGGCTTTC CCTTATCGGT AATCCACCAC CCGAACCGTC ACGTTTGACG TGGCAGCTTA CGGCTGTTGC CAGCGCGGGT GTGTTGATTC TTGCCTTGCT GACCATCTTT TTCAGCCAGA GTCGCGGTCC GTGGATCGGA CTCGGCGCCG GACTATTTGT GTTCATCTCG TTGATGCTGT GGTACGGGCG ACAACGGTTA CGGGCAAATG GCAATCTTAG CGGTAGTCGT CTGCTGACTC AGGCATTGGC TGGATGGGTA GCGTTGATCG TCCTGATCGG TGGGTTTCTG ATCGTGTTTA ATCTGTCTGA TGCTCCCATC TTTACCCAAA TGCGCGAAAT TCCGTACCTC GGTCGTATGG GTCGACTGCT GGAAGTGGAT AGCGGTACCG GCTTAGTACG GCGGTTGATT TGGGTGGGAG ATGAGCACGG GCGTGGAACT ATCGGTCTGA TCACGAGCGA TCCACTGCGC CTCCTCATTG GGTGGGGGCC GGAGAGTATG TTCGTCGCAT TCAATCGTTT CTATCCACCG TCGCTGGCAC ATGTTGAAGC GCGTGGCGCT TCCCCCGATC GTTCACATCA GGCATTGCTC GATGAGCTTG TGACAAAAGG CTTGCTTGGC CTCGTTACCT ACCTCTGGTT GATCGGTAGC ATAGTCTGGT TTTGTCTTCT TCAATTGCGA CGACCGACGA GCTGGCAGTA CCAACTGTGC ATCATTGGTA TTTTGAGCGC AATCACCGCC CATGTGATCG AAGGGCTAAC CGGTATTCCC ATTGTGGCAA CCCTGATGAT GTTTTGGTTG TTGATAGGGT TGGCTATCGC AGCAGAGCGC ATCGAGCACG GGCATGCCCA ACGACCGGCC ACAATCCCTG AACCGGTCGT GGCGCAGCGG GCCGAACGCC CGGCTACGCG CCGAAATCAA CCGCGCCCGC CAGTACGTCG TCCGTCCAGT CGCCGTCCGG TTACCGGTAC GATAGGCACC GCCGCTCTGA TCGGTCTGGT TACTGCCGGC CTGATCTGGT GGCTGAACAT ACAGCCCATT TACGCCGACA TGCGCTTTCA ACAAGGTCAA AGCTATAGCG ACCAGGGTCA AGTTTCGATC AGCACCCAGA TCGCCACCCT CAACGAATAT ATCGCCACCA TCCGGGCTAA TCCGGGTGAA GACTTTTACT ACTTACACCT TGCCCGTAGC TTAATGTCAC TTGCCGATAC GCTTCGTCGC CAGGGTGTCA ATCTTGGTGA GGCCGGTCAA CCTCGTCTTG ACGCACTGTT ACGCCTCGAT GGGATAGAGG CGGCGACCGG ATTCGTCCAG CGTTCTTCGC CGTTAAGTTT ACTCGCCTAC GCAGAGGCAA CATTACAGCA CGCTCATCAG CTTAGTCCAC TCAATAAAGA CCATTACGCA AATTTAGGCA GAATCAATCT ATTCTGGTAC AGTTGGACAA ACGACGTGCA ACGATTATAT ACAGCGCTCA AGTGGTACGA GCGCGTCGCC GAGATTGCCC CTCAAGATGT TGCACTGATG AACGAGCGGG CCGGTGTTCT GATACAACTG GCCGAATATG CTACTATCAG CGGTGATACT GCACAAGCCG GTGCATTTTT TCAACAGGCC GATGAGTTGT TGCAGACTTC GGCTCAACTC GATCCACGCT TCGGTGACAC GTCTCTCCGA CGCGGTGATC TGATCCGTCT TCGCAGCGGT GATCTCGATA CGGCCACGAC ATTCTACCTC CGTGCTATCG AGCAAGCACC TCAGCAGATG GTTGACAACC TTGATCGGAT TACTCGCGCG TTGAACAGTC GCCCCGATCT GCTTAACCAA CTACGTAACG CTTTCACCGT ACAAGCGACG CGAGCAGAGC AGGTGTTAGC CGAAGCGCGG GAAAAACCGG AACGGGCATT CGAGCTACCC ACCCTTGAAA CACAAGCTGC AAACCTGTAC GCAGCCGTCG CCCGCCTTGC TGTGCAGACA AACGATCTTG CCGGCGCCAT TGAGCCGTAT GCACGGGCAG TGTCGATCCA ACCGGCAAAT GCTGCGTTGA GCGAGCAGTA TACGTTAGTT TTGAGTGAAA CACTGCAATA CGACGCCGCC CTCACCGAAG CGCGTCGCTT GCTGGCGGTT TTGCAAAGCA ATGGGCGTAC CGGCGAGATG GCGCGGATCG AGCAGTTGAT TGCGCTGATC GAACAGGTAC GAAAATAG
|
Protein sequence | MSSQSTIGRW SVRVIEASWL LALTLVPIYF NLYSARHFEP DKAAVLRSLA LIGFAAGLIW LLDWLGQRPS NTRPASWWTT VHASPLFWPT VAYTAVFAFT TITSITPTVS LWGSYQRMQG LYTYLSYVTL GVLVAVALRT PAQRERFITI SLAAATVVSV YGIMQHYQLD PLPWRGDVIA RVASTLGNSI FVAAYLIMIV PLALYRLGAA LAAAQVAPRS DRPAAEWRWA LSRVSLWLSG TLLIIAILKF SVAIRTIDFR YWWLFPGAII CATALWWLLT ARRSQALPRW PLVLTLLYLL GFALAFVLNA SAGVQQFALP DIAANALDWS VWLGLAIAAL VIGYGLSLIG NPPPEPSRLT WQLTAVASAG VLILALLTIF FSQSRGPWIG LGAGLFVFIS LMLWYGRQRL RANGNLSGSR LLTQALAGWV ALIVLIGGFL IVFNLSDAPI FTQMREIPYL GRMGRLLEVD SGTGLVRRLI WVGDEHGRGT IGLITSDPLR LLIGWGPESM FVAFNRFYPP SLAHVEARGA SPDRSHQALL DELVTKGLLG LVTYLWLIGS IVWFCLLQLR RPTSWQYQLC IIGILSAITA HVIEGLTGIP IVATLMMFWL LIGLAIAAER IEHGHAQRPA TIPEPVVAQR AERPATRRNQ PRPPVRRPSS RRPVTGTIGT AALIGLVTAG LIWWLNIQPI YADMRFQQGQ SYSDQGQVSI STQIATLNEY IATIRANPGE DFYYLHLARS LMSLADTLRR QGVNLGEAGQ PRLDALLRLD GIEAATGFVQ RSSPLSLLAY AEATLQHAHQ LSPLNKDHYA NLGRINLFWY SWTNDVQRLY TALKWYERVA EIAPQDVALM NERAGVLIQL AEYATISGDT AQAGAFFQQA DELLQTSAQL DPRFGDTSLR RGDLIRLRSG DLDTATTFYL RAIEQAPQQM VDNLDRITRA LNSRPDLLNQ LRNAFTVQAT RAEQVLAEAR EKPERAFELP TLETQAANLY AAVARLAVQT NDLAGAIEPY ARAVSIQPAN AALSEQYTLV LSETLQYDAA LTEARRLLAV LQSNGRTGEM ARIEQLIALI EQVRK
|
| |