Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_2344 |
Symbol | |
ID | 7268694 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | - |
Start bp | 2852676 |
End bp | 2853611 |
Gene Length | 936 bp |
Protein Length | 311 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 643567173 |
Product | pentapeptide repeat protein |
Protein accession | YP_002463658 |
Protein GI | 219849225 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00177045 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGGCCGATT TGACACCTGA TCAGATCTAC GATTTACTCA ACCGGCCTGG CCCACTGTGG TTAGTGGGGG CAAATCTCAG TGGGGCGAAC CTGAGTGCCG CAAACCTCAG CGGGGCGAAT CTCAGTGAAG CCAAATTGAG TCGTGCGCGG CTGACCGACG CCAATCTCTA CCGAGCCGAT CTGAGTATTT GTGAGTTGGG TGAAGCAAAT CTGAGCTGGG CAAATCTCCG TGAAGCGAAA CTCAATTGGG CGCAGTTAGT GCGGGCCGAT TTGAGCGATG CCGATTTACG CAAAGCCGAC CTGAGCTGGG CCAATCTTGA GTTTGCAACG CTGATTGGGG CCAACCTACG TGGCGCCAAT CTGAGTGCGG CTGATTTCAG TGGTGCGAAT CTGTATGGCG CAAATTTAAG CCTCTGTAAC TTAAGTGGGG CCGATCTGCG TGATACGGTG ATGATCGGTG CCAATCTGAG CGAAGCCCAA CTACGTGAAG CGCAATTGGT TAACCTGAGT GGAGCGAACT TGAGTGGGGC GATCTTGCTT CGCGTCAGTT TAAACGGAGC AAACCTCAAC GGCGCGAATT TGGCCGGGGC TAACTTGATG CACGCTAATC TACGTGAGGC GACGCTCGAT GAGGTGAATT GTATCGGGGC AAATTTGAGT GAGACAAACC TTAGTGAGGC AAGTTTGTGC AATGCCGATT TTAGTGATGC TAACCTGAGT GGGATTTATC TCAGTGGGGC ACATTTACGT AACGCTATCT TCACGCGCGC GAATTTGTCG CGGGCTAACT TGAGCGGTGC CAATTTACGT GGTGCGAATC TTCGTGGGGT GAATCTCCGT GAGGCGAGTT TGGCCGATGC CGATTTGACC GACGCCGACT TGACCGACGC CGACTTAACC GATTGTGATC TGAGCGGTGC GAAGGGGTTA CGATAG
|
Protein sequence | MADLTPDQIY DLLNRPGPLW LVGANLSGAN LSAANLSGAN LSEAKLSRAR LTDANLYRAD LSICELGEAN LSWANLREAK LNWAQLVRAD LSDADLRKAD LSWANLEFAT LIGANLRGAN LSAADFSGAN LYGANLSLCN LSGADLRDTV MIGANLSEAQ LREAQLVNLS GANLSGAILL RVSLNGANLN GANLAGANLM HANLREATLD EVNCIGANLS ETNLSEASLC NADFSDANLS GIYLSGAHLR NAIFTRANLS RANLSGANLR GANLRGVNLR EASLADADLT DADLTDADLT DCDLSGAKGL R
|
| |