Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_3033 |
Symbol | |
ID | 7266564 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | + |
Start bp | 3689794 |
End bp | 3690996 |
Gene Length | 1203 bp |
Protein Length | 400 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 643567854 |
Product | putative transposase IS891/IS1136/IS1341 family |
Protein accession | YP_002464328 |
Protein GI | 219849895 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0675] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.00000443442 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGTCGGTAG TACTCGCGCA CAACATTGAG CTTTGTTCGA TACAAGCGCA AGAAGTGTTC CTGCGCGAGG CAGTTGGTGT GGTGCGGTTT GTCGACAACT GGGTGCTCAA GGAGTGGCTA ACGCGTGACG AAGCGTACAA ACATGTGCTG CCCAACCCAC ACGAGGCAGC CCTGAGACGG CAACCGAACG CGATCAAGTG CATACCTAAG CGTGTACCGC AGTATGCCAT CACGAATCTG GACAAGGCAT GCAATCGCTT TTTCCGTAAG GAAAGCGGGT ATTCCACCTT CAAGAAGAAG GGTAGGCACG ACAGTGCCCG TTTTGACAAC GGGTTGGGCA TCGTTCGCTA TGCAGGTAAG CGTATCAACC TACCCTTTGT CGGGGGGGTA AAGATGCGTG AGGCGCTGCG CTTGATTGGC AAAACCCGGT CTGCCACGTG CGCGAGCGTG GCAGACCGTT GGTACGTTTC TCTCCCTGTC GAGGTTGAAG TGCTCAAGCC CATCCGCGAG AGCCAAGCAG CGGTGGGCAT TGATCTGGGC GTCACCACGG CGGCAACACT TTCAACCGGT GAAAGACTGG AAGGCCCGCA AGCGTTACCC AAGCATCTTA AGCGTCTGCG CCGTTTGAGC CATCACTATC GTTGCAAAGC CATCGGCAGC AACAACCGGC GCAAATCGGC GCGGTGCTTG GCGCGGTTTC ACGCGCGCAT TGCCACCATT TGGCGGGATT GGCTGCACGA CCGGCAGCTT GCCCGCGCGC TTGTCGACAT TAGGATGGAT AAGTTCAAGC GCCAGCTTCG CGACCAGACG GCGCTCTCTG GTGCGATGCT GGTGGAAGCC AATCCGTGGT TTCCCTCCTC GAAGATGTGT TTAGGGTGCG GTACCGTGGT GAAAAACTTG CCGCGTTCCG TTTGGGAATG GACGGGCGAC AGGTGTGGTA CGCACCATGA CTGCGATGTG AATGCGGCAA AAATCTTGAG TGCCCGGTAC TCACGAGGGG AGTTGCGCCG CCGTCACGCC TGTGGAGACC GGTCCTGCGG CGGGATAGCG CCGTACCGGT CTCTGCGTAT CTCGTCGGTG AAGAAGGAAT CAAACGTGTC CTCTTTGGGC ATGGTTCGAG CAACGGAAGG TAGGAATGAC CGATCAAGAA GGCGCCTCCG AACAAGTTAT CGAATTGCGC CTGCCGAGCC GTTTGGGGTA TGA
|
Protein sequence | MSVVLAHNIE LCSIQAQEVF LREAVGVVRF VDNWVLKEWL TRDEAYKHVL PNPHEAALRR QPNAIKCIPK RVPQYAITNL DKACNRFFRK ESGYSTFKKK GRHDSARFDN GLGIVRYAGK RINLPFVGGV KMREALRLIG KTRSATCASV ADRWYVSLPV EVEVLKPIRE SQAAVGIDLG VTTAATLSTG ERLEGPQALP KHLKRLRRLS HHYRCKAIGS NNRRKSARCL ARFHARIATI WRDWLHDRQL ARALVDIRMD KFKRQLRDQT ALSGAMLVEA NPWFPSSKMC LGCGTVVKNL PRSVWEWTGD RCGTHHDCDV NAAKILSARY SRGELRRRHA CGDRSCGGIA PYRSLRISSV KKESNVSSLG MVRATEGRND RSRRRLRTSY RIAPAEPFGV
|
| |