Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_3592 |
Symbol | |
ID | 7269736 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | + |
Start bp | 4369956 |
End bp | 4370981 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 643568400 |
Product | glycosyl transferase family 2 |
Protein accession | YP_002464866 |
Protein GI | 219850433 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0463] Glycosyltransferases involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.000663913 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCTGATG TACAACCAAC CTTCTCGGTC GTCGCGCCGG TGTACAACGA AGAGCAGTTG ATCGCCGAGT TTTGTCGGCG GGTTATTGCG GTACTTGAAC CGCTAGGGGA ACCGTTTGAG CTGGTGTTAG TGAACGATGG CTGTCGCGAC CGCTCACCGG AGATTATGCG CGAGCTGCAC GAGCGTGACC CGCGAATTAA GGTGATCAAT TTCTCGCGCA ATTTTGGTCA TCAGATCGCG ATTACTGCCG GTACCGACTA CGCGACCGGT AAAGCAGTGA TTGTGATCGA TTCGGATTTG CAAGACCCGC CTGAGGTGAT CCCCGCGCTG ATTGCGCGTT GGCGTGAAGG GTATCAGGTC GTCTATGGTG TGCGCGAAGA GCGTGAAGGT GAGACGTGGT TTAAAAAGAC AACGGCGTCC ATCTTCTATC GCTTGATCGT GCGGATTACC AATGTCAACA TCCCGGTCGA CACCGGTGAT TTTCGTCTGA TGGATCGCAA AGTTGTCGAC GCTCTCAAGC GTATGCGCGA ACATCATCGC TTTATGCGTG GGTTGTCGGC GTGGGTCGGT TTTCGTCAGA CCGGGGTGCC ATATCGTCGC CATGCCCGTG CTGCCGGTAC CACCAAATAC CCGTTACGCA AGATGTTGCG TTTTGCCCTC GATGGCATTA CCAGCTTCTC GTATTTGCCG CTGCAATTGG CAACCTATCT CGGTTTTGTG GTCGCCGCAA TTAGTATGAT CTTCCTGCTG GTTGTGTTTG TTATGCGGCT AGCGAACCCC GCGGCTGCCG AACCGGCGTT TTATGGGCAA GCCAGTACGC TGGCAAGCGT GCTCTTCCTC GGCGCAGTGC AACTGATTTC GCTCGGCATC ATCGGCGAGT ATGTCGGTCG TATTTACGAT GAGGTGAAAG GCCGGCCACT CTATATCGTC GCTGAAACGT TGGGTATCGC CGAGCCGGAT GCAACTTCTG CCGCGATGGT ACGTACTTCA TCTACAGAGC ATGAGGTAAC AACGTCATCG GGGTAA
|
Protein sequence | MSDVQPTFSV VAPVYNEEQL IAEFCRRVIA VLEPLGEPFE LVLVNDGCRD RSPEIMRELH ERDPRIKVIN FSRNFGHQIA ITAGTDYATG KAVIVIDSDL QDPPEVIPAL IARWREGYQV VYGVREEREG ETWFKKTTAS IFYRLIVRIT NVNIPVDTGD FRLMDRKVVD ALKRMREHHR FMRGLSAWVG FRQTGVPYRR HARAAGTTKY PLRKMLRFAL DGITSFSYLP LQLATYLGFV VAAISMIFLL VVFVMRLANP AAAEPAFYGQ ASTLASVLFL GAVQLISLGI IGEYVGRIYD EVKGRPLYIV AETLGIAEPD ATSAAMVRTS STEHEVTTSS G
|
| |