Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_3706 |
Symbol | |
ID | 7268242 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | - |
Start bp | 4504225 |
End bp | 4505202 |
Gene Length | 978 bp |
Protein Length | 325 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643568513 |
Product | glycosyl transferase family 2 |
Protein accession | YP_002464978 |
Protein GI | 219850545 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000000558134 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.00000472149 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGAACATA CCTTGTCGCC GATTGGCGTT GTCGTCGTTT CGTACAATAC GGCCCCCTTG TTGCGGCGTT GTCTGGCGTC GTTGCAGGCA TGCACGTTGC CACTGCGGAT TGTTGTGGTG GATAACGGCT CGCAAGATGA GAGCGTCGCG CTGGTGCGGC GTGAATTCCC GACGGTGACG GTGCGAGAAC GGCCGGATAA TCCCGGCTAT GCGGCTGCGT GTAACGAGGG TATCAGCCTA CTCGTAGATA CCTGTGGGGC GATTTTGGTG CTGAACCCCG ACACCGAGCT ATTACCGGGA GCAATTGAAG CGATGGCCGA TTTCCTTGCG GCGCATCCGC GCGTTGGGTT GGTGGGGCCG CGCTTGCTCA ACCCCGATCA CACACTTCAA CGAGCTGCCT TCCGTTTTCC CGATCTGATC ACGACGGCCC TCGATCTCTT CCCGCCCGGC GAAGTCTTGC CCGGTCGCCT CTACGACTCG TGGTGGCACG GACGTTACCC GGCTGAACTT GGTGATGCTC CCTTTCCGAT TGACTACCCC CTTGGCGCCT GTATGATGGT ACGCAGCGCT ACCATTGGCG AAGTGGGGCT TATGGACGAA GACTATTTTA TGTACTGTGA AGAGATCGAC TGGTGCCGAC GGATCAAGCA GGCCGGATGG GCAATCTGGC AGGTTCCGGC GGCACACGTT ATTCACGTCG GTAGTGCAGC TACCGGCCAA TTCCGGTGGA AAATGTACGT CGCCTTGTGG CGGGCACGGG CGCGGTACAC GGCCAAGTTC GGGGGCCGAG GTCTACGCTA CGCACATACC GCCCTCGTCA CTCTCGGTAT GCTCCGCCTG ATCGGCAAGG CGTGGCGTGA TTACTTCAGT GGACGGATCG ACCGGGACTC CCTACGCGGG CAACTCCTCG CCTACGGATT AATTTTGCGC ACAACCGGCA ATTTGGCTGC TCAACCGGTT GCGAAGGCGA CAGGCTGA
|
Protein sequence | MEHTLSPIGV VVVSYNTAPL LRRCLASLQA CTLPLRIVVV DNGSQDESVA LVRREFPTVT VRERPDNPGY AAACNEGISL LVDTCGAILV LNPDTELLPG AIEAMADFLA AHPRVGLVGP RLLNPDHTLQ RAAFRFPDLI TTALDLFPPG EVLPGRLYDS WWHGRYPAEL GDAPFPIDYP LGACMMVRSA TIGEVGLMDE DYFMYCEEID WCRRIKQAGW AIWQVPAAHV IHVGSAATGQ FRWKMYVALW RARARYTAKF GGRGLRYAHT ALVTLGMLRL IGKAWRDYFS GRIDRDSLRG QLLAYGLILR TTGNLAAQPV AKATG
|
| |