Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_3737 |
Symbol | |
ID | 7267810 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | + |
Start bp | 4550192 |
End bp | 4553182 |
Gene Length | 2991 bp |
Protein Length | 996 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643568544 |
Product | hypothetical protein |
Protein accession | YP_002465009 |
Protein GI | 219850576 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.16903 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATAACAC TCGTGTTTGA ACTGATTTTG GTTTTACTTG TTGCTAGTCT CCTCGCGTTC TGGAATGGAT GGGGACTGGC ACGTTTGTTG CTCCCGGCGG CCGTTGCGCC CTGGCGAGCG TTATTGTCAC CACTCCTTGG CTATGCGCTG ACGATCCTCG TTGGTTACTG GGTGGTACGG TTTATCGGTG GTTTAGGATG GGCGTTGGGA TTGATCGCTC TCCTGAGCGG ATGGTTCAAT TGGCTCGCAT GGCGATGGTA TGGTCCACCT CAGATCATCA TGGCACTACA CCATCATTGG CCGGGATTGG TAGTAGCCGG TATTGCGGTT GCGTTTGGAG TTGCACCACT ATTGAGTTAT GGCTATGCTG CGCCCATCGG TGGCGGCTGG GATATTGAGA ATTACTGGCC TACTGCGCGC TATCTGGTCC GTGGTCCGGT GAGTGCCATC GCAACGGCAC CACCAAACCC CTTGCGCGAT ATCAATGCTG ATCCGCCGCG AATTGGCTTG ACCCTTGGTT TTAGTATTTG GCAAGGTAGT GTTGATCTCC TGAGTGGGAG TGAACCCCTC GTTAGTTTCG CACCATTGCT TGCGTGGCTA CGTGCGCTTG GCGTCGTCGG GATATATGTG CTCTTACAGG CGGTGTTTAC CCTGCGGCGT GGGCCGGCAG CGTTTGCGGC TCTTCTGGCG GCACTTAACG GTCTGTTGCT CTGGACGAGC TACTTCAATT TCGGGATGCA ATTAGCAGCA TGGCCGCTGC TGCCGTTGGT GCTAACGCTT GGTTTAGCAA CTGTGCGTGC CGATGCCGAA CGTTGGGCAA CCCAATCGTT GGTCGGTCGA GTTGCCCACT ATTGCGCTGC TGCCATGAGT TTGGCGGCTA TACCGGTTGC CTACTATCCA GCCCTTGGCC CGCTGGGATT GATGGCAGTG GGGATTGGAA TCGTTGTGCT TGGGCAGACA CGCGACCGTT GGCGGCTGAT TAAACGGTCG CTGGTACTTC TTCTATTGAC GCTGGCACTG GCTGCACCGA CGATACCCGA TTATTTTGCC GGCTTCAACT ACCGTTACAG CCTGCCACTT ACGACGCTTG GTCTCTTTCG TTTTATTCCA CTGAGCGACA TTGTTGGTTT TACCATCTTC CGCCTACGTG AGGTGTCCGA ACCACTAACA GTACCCGCCT GGGTAGCAGC AGCGATGCTC ATTGTTCTCG TCGGATATGG CTTGATCACC AGCCCCCAAC GTTGGTACTG GGTTGGGATG TTGGCCGGCG CAAGTAGCTT TTTACTCTAC CTGCGTTTTG GGGCTGTATA CCACTACGGC TATCTCAAAG CTGCGGCGTA TGTGGGATGG ATGGCCGGAG CACTAGCAGC AAGTGGTTTG CAAGCTATGC TCGATCATCT GCGTAGTCGT TCGGGGTGGC AACGAGTGTT GGTTACCGGT AGTGTGACCG TCTTGATCGG TAGTCCGGTG GCATTGACGG CTATTCGTGT AGTCGCCGAT CATTGGGGTA AACCTGCCTT ATTTGCCGAT CAATTACCGG TATTACGCGA GCTGCGTCAG CTTGTTCCTG TCGGTAGCAC GGTGCGTCTT ACCGGTGATC CGCGGGTCGA GGGGGTGACG AGCGCGCTCG CAGCGTACCT TCTCGATCAC ACACGGGTGT GGGGCAATGT TCGTACCGGT TATGCTTCAT CGTCCGCCGG TGAATCCGAC GCTATTGCCG AATACGCGCT CTTACAACGG GATGAAGATC CGACCTTGTG GGGCTACACC GATCCGCCGA TCTGGAGCGG TGGATCATAT CGCCTGTATC GGCGTCCGTC TGAGACAGTG GCACATCTTA TCTGGCAGCA CATAGTAGAC AAGGAGGCCG TGACCCTGCC GATCATCGGC GAGCGTCTGG CTTCTGAGGA GACAGTTTTC GTTGCGGAGC GTGAGCCACG CTGGATCGCG TTACAAACCG CTAGTTTCAC GCCGGCTGAG CTTATCATCA ATGGTGAGCG ATATGGGGTA CCTGCCGGTC GTTCGCGTGT GGTTATTGGT CCGCTGCGAA AGGATCAAAA GCTCACCCTC CAAGCCGTCG ATCGGCCGGT TCAGATTCAG ACGGTGAGCT TGCTCACGAC GCGATCAAAG AGTCAAGTGT CCAGGTTCGC GCATAGTGTC ACGATCAAGG CTGCCAGTGT TGCAGATGGG TCGACGGTGG CAACAAGTAT TGACATGCTC AACGCGGACA GTGGTCCGGT GGTGGTTGCA CTTGAATTGT GGGAGCGCCG GCAAGGAATA CTGTTCGGCC GCTATGGATT GCGTGTTATG CCATCAGCAG AGGTACAACA GGCCACGGTA ACTCTCGATC TTGCGACCGG TGCAGCGCGT GCGCAAGATG CTGCCGGTGC TCCGATAGTA CTGGGCGTGG ACCAAGGTGC ATCTTTACCC GGCACCTATA TCGCTCGGCT GTGGGTTGGT ACTGACCAAC GCGCATTACT GACACCGGTT GATCTATTTA CCTTCACGAT CGATCGGCAA GGGGCAGTAA CAGTTGATTG GACGGCGCAA ACTTCCTTGT TGACGGCGCA AATTGAGCGA CCATTACAAC CGCTGAGCGT TCAGTTTGGT GATGATATGC TGCTCCGCGG TTACGATCTA AGCACGACAC AGGCTACACC CGGTGAGACG ATAGCGTTAA CACTCTGGTG GCAGGCATTG CGGGGTAATC TCGATGAACG GAGCATCATG GTTCACGTGC GCAACGGCCA AGACGAGCGT ATCATCGACG CCGATGGTCC ACCGGCGGGT GGTGGCCGGC CAACGAGTGT GTGGCAAGGC GGTGAGTTGA TTATCGATGA GCGACACATA ACGATTCCTA CCGATGTGCT ACCGGGTCAC TATTGGTTGG TTATCGGAAG TTACCGATGG CCCTCGCTCG CTCCAATCCC ACGGGTAGGT GCCGATGAGG CGGTGTGGCG TATTCCGATT GAGATTGTGG CACGGCGTTG A
|
Protein sequence | MITLVFELIL VLLVASLLAF WNGWGLARLL LPAAVAPWRA LLSPLLGYAL TILVGYWVVR FIGGLGWALG LIALLSGWFN WLAWRWYGPP QIIMALHHHW PGLVVAGIAV AFGVAPLLSY GYAAPIGGGW DIENYWPTAR YLVRGPVSAI ATAPPNPLRD INADPPRIGL TLGFSIWQGS VDLLSGSEPL VSFAPLLAWL RALGVVGIYV LLQAVFTLRR GPAAFAALLA ALNGLLLWTS YFNFGMQLAA WPLLPLVLTL GLATVRADAE RWATQSLVGR VAHYCAAAMS LAAIPVAYYP ALGPLGLMAV GIGIVVLGQT RDRWRLIKRS LVLLLLTLAL AAPTIPDYFA GFNYRYSLPL TTLGLFRFIP LSDIVGFTIF RLREVSEPLT VPAWVAAAML IVLVGYGLIT SPQRWYWVGM LAGASSFLLY LRFGAVYHYG YLKAAAYVGW MAGALAASGL QAMLDHLRSR SGWQRVLVTG SVTVLIGSPV ALTAIRVVAD HWGKPALFAD QLPVLRELRQ LVPVGSTVRL TGDPRVEGVT SALAAYLLDH TRVWGNVRTG YASSSAGESD AIAEYALLQR DEDPTLWGYT DPPIWSGGSY RLYRRPSETV AHLIWQHIVD KEAVTLPIIG ERLASEETVF VAEREPRWIA LQTASFTPAE LIINGERYGV PAGRSRVVIG PLRKDQKLTL QAVDRPVQIQ TVSLLTTRSK SQVSRFAHSV TIKAASVADG STVATSIDML NADSGPVVVA LELWERRQGI LFGRYGLRVM PSAEVQQATV TLDLATGAAR AQDAAGAPIV LGVDQGASLP GTYIARLWVG TDQRALLTPV DLFTFTIDRQ GAVTVDWTAQ TSLLTAQIER PLQPLSVQFG DDMLLRGYDL STTQATPGET IALTLWWQAL RGNLDERSIM VHVRNGQDER IIDADGPPAG GGRPTSVWQG GELIIDERHI TIPTDVLPGH YWLVIGSYRW PSLAPIPRVG ADEAVWRIPI EIVARR
|
| |