Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_1354 |
Symbol | |
ID | 7268646 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | - |
Start bp | 1678391 |
End bp | 1679704 |
Gene Length | 1314 bp |
Protein Length | 437 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 643566197 |
Product | WD-40 repeat protein |
Protein accession | YP_002462697 |
Protein GI | 219848264 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0105276 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.000000574581 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCAGCACC GTTCTTCCTT CAACGAACCC ATTGAGTCGT CAGCTCCTGC GTCGTCAGGG TTGGAACATC TGCGACAACA ATTGCTGATA GCATATGATG GCGCATGTGC AATTAGTGGT TGTCAAGTTG ATGCTGTCTT GCGTCCAGTT TTGATCGATC CTAATGGGCC GGCTGAGCCG TATAATGCGT TGTTGTTGCG GGCCGATCTC CAGCAGCTTT TTCAATCCGG TCTGCTGACC ATCGATGCCA TGACCCTCAA AGTGCTTGTG GCGCCGGCGT TGCAGAATAG TGAATACAGA GCACTGGCAG GTAAACGGCT CCGTCAGCCG AAACGGTTGC CCCTTCGACC AAGCCGACAT ATGTTGGCTG CTCATCATCG CCTTTTTCAA CTTGAGCGAC CGTCTCTTGA CTCTTGTACG GCGCGACCGC CTTCTCTACA GCGATTATTG GCCGTGCAGA GTTGGGTGAA AACGCTGGCA TTTAGTCCCG ATCAGCAGAC CTTGGCGACC GGTTCTCTGG ATGGGAAACT ACGGCTTTGG CGGTGGTCTG ATGGTCAATT GCAGCGCGTG CTGAGCAGTA GGATTGATGA AATCAATGCA GTGGCTTTTA GTCCTGATGG CCAACGGATT GCGGCTGCAG GTCGTCAGGA TGGGGTACAG GTCTGGCGAG TAGCCGATGG AGAACCGCTC CTGTATCTCC GTAACGACCA ACGCCATGGA GCACTTTTTA GTGTAGCTTT TCAGCCCAAT GGTGATCTGA TCGCGGCTAC CGGCTGGGCA CCGGTTATCT GGCTGTGGAA TGCGACTGAT GGCAGCGTAA GTGGGGGCTT ATCCGGTCAC GAAGGCTTCA TCAATAGTTT GGCATTCCAC CCAGGTGGCG ACTTGCTCTT ATCGGGTGGC CAAGACCGGA TTGTCCGACT CTGGCGTATC CCCGATCGAT CGTTGGTTCG TGAGATGCAT GGTCACGATG ACGAGATTCT CAGTGTTGCA TTTAGCGCTG ATGGCGAATT AGCCGCCAGT GCAAGTGCTG ATGGGGTGAT TATTGTCTGG CAGGTCGCTC ATTGGCAACC GGTGCAGATG TTGCCTTCCT ATGCTGGAGC GTGTTCGAGT CTTGCGTTTA GTCCTGATAA TCGGTATTTG GCGAGCGCTC ATGATGGTCG GACTGTGCTC ATGTGGCAGG TAAGTAATGG AGAACTGCGT TGGGAACTGC GAGGTCATGG CGAACGTGTG ACGTGTGTGG CATTTGCACC GCGCGGGAAT GTCCTGGCGA GCGGGAGTTT TGATGCGGTA GTGCGAATTT GGGCGTATAA GTAA
|
Protein sequence | MQHRSSFNEP IESSAPASSG LEHLRQQLLI AYDGACAISG CQVDAVLRPV LIDPNGPAEP YNALLLRADL QQLFQSGLLT IDAMTLKVLV APALQNSEYR ALAGKRLRQP KRLPLRPSRH MLAAHHRLFQ LERPSLDSCT ARPPSLQRLL AVQSWVKTLA FSPDQQTLAT GSLDGKLRLW RWSDGQLQRV LSSRIDEINA VAFSPDGQRI AAAGRQDGVQ VWRVADGEPL LYLRNDQRHG ALFSVAFQPN GDLIAATGWA PVIWLWNATD GSVSGGLSGH EGFINSLAFH PGGDLLLSGG QDRIVRLWRI PDRSLVREMH GHDDEILSVA FSADGELAAS ASADGVIIVW QVAHWQPVQM LPSYAGACSS LAFSPDNRYL ASAHDGRTVL MWQVSNGELR WELRGHGERV TCVAFAPRGN VLASGSFDAV VRIWAYK
|
| |