Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_0631 |
Symbol | |
ID | 7266103 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | - |
Start bp | 776147 |
End bp | 777910 |
Gene Length | 1764 bp |
Protein Length | 587 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643565492 |
Product | LPXTG-motif cell wall anchor domain protein |
Protein accession | YP_002462004 |
Protein GI | 219847571 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0737] 5'-nucleotidase/2',3'-cyclic phosphodiesterase and related esterases |
TIGRFAM ID | [TIGR01167] LPXTG-motif cell wall anchor domain [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGAAGA TCTCACGTCG GCGGTTTCTC AAAGGGACGG TTGCTCTCGG TGCTGGCGCA TTGCTTGCGA TCTACAGTGA TGGTAGCTTC CGGCTAGCAT TGGCACAGGA AAACCCTGCG TTCCGCATGC GGATTCTGCA CACCAATGAC CACCATGCCC GGATTGAGCC TGTGTTCAGC GGTAACAATC CGGTTCACGG CGGTGTCTCG CGCCGTAAAG CGTTAATTGA CAAGATCCGT CGCGAGACGG CACTGCCGAC CTTACTGGTT GATGCCGGTG ATGTATTTCA AGGGACGCTC TACTTTAACC AATACAACGG CATGGCCGAC CTCGAGTTCT ATAACGCAAT GGGCTATGAG GCGATGGCCG TCGGTAATCA CGAATTTGAC AAAGGGCCGC AGGCATTAGT CGATTTTATT ACGCGTGCCA AATTCCCGGT GTTAAGTGCT AACATCTCGG TTGCCGCCGG CAACCCACTG GCCGGTCTGA TCAAGCCGCG CACCATCATT GAGAAAGATG GTAAGAAGAT TGGGATTTTC AGCCTTACGC CTGAAGATAC CGGTGTGCTG TCGAATGCCG GCCCCGGCAT TAGCTTCACA TCGGCGATTG AAGCGGCACG GCAGCAGGTT GCCGCGCTGA AGGCGGAAGG TGTCTTCACG ATCATCGCTC TGACCCACGT CGGGATTAAT GTTGATCGCC AGATTGCACG CGAAGTTGGT GGAATGAGTC TGATTATTGG CGGCCACTCA CACACGCCGA TGGCACCGAT GAACAATGTG CGCACGCCGC CGTACCCCGA ACTCATCGCC GGGCCGGATG GCAAGCCGGT GGTGGTCGTT ACCGATTGGG AGTGGGGGCG CTGGCTAGGT GACATCACCG TAGCCTTTAA TGCCGCCGGC ACGGTAATCG ACTTACAGGG CAACCCGACT GAGGTGCTGC CGTCGTTGCC GGCGGATCAG GGGTTCGAGA ACCGGATTGC GGTCTTCAAG GGGCCAATCG AGCAGTTGCG TGCGCGGGTG GTTGGTTCGG CAGCGGTCGA TCTCGATGGC AGCCGGACCA ACATCCGCTC ACGCGAGACC AATCTCGGCA ATCTCGTGGC CGAGGCGATG CTGGCGAAGG CGCGTAATTC AGGAGCCACT ATCGCCATTA CCAACGGTGG CGGTATTCGA GCGTCGATCC CTGCCGGTCC GGTAACTGTC GGCCAGATTT TAGAGGTCTT GCCGTTCGGT AATACGCTGG CGCTCGTAAC ACTCACCGGG TCACAGGTCA TCGAAGCGCT TAACAATGGT GTAAGCCAGG TTGAGAGCGG TGCCGGTCGG TTCCCGCAAG TGGCCGGGCT ACGCTTCACC TACGATCCGT CACTGCCAGC AGCCAGCCGG GTGACGAGTG TGACCGTCGG TGGCGCGCCG ATCGATCAAA ACGCCAGCTA CGTCGTCGTC ACCAACAACT TTATGCTGAC CGGCGGCGAC GGCTACAGCG TCTTTATCCG CGGGCGCAAT CAGGTTGACA CCGGCTTCAT TCTGGCCGAC GTGGTAGAGG AATACATCGC CGCCAATTCA CCGGTCAATC CGGCGGTCGA TGGGCGCATT GCTATCGGTG CAGCACCGGC AACGACACCG GCGCAACCGG AGACGCCGGC GCAGCCGGTG CCGGCAACGT TGCCTAACAC GGGTGGCGCG CTGACGCCAC TGGCGTGGCT GGCCGGGTTG GGTGCGGCGG CGCTGGCCGG TGGTGCCGCG TTGCAGCGTA GTGAGAAGGA GTAA
|
Protein sequence | MEKISRRRFL KGTVALGAGA LLAIYSDGSF RLALAQENPA FRMRILHTND HHARIEPVFS GNNPVHGGVS RRKALIDKIR RETALPTLLV DAGDVFQGTL YFNQYNGMAD LEFYNAMGYE AMAVGNHEFD KGPQALVDFI TRAKFPVLSA NISVAAGNPL AGLIKPRTII EKDGKKIGIF SLTPEDTGVL SNAGPGISFT SAIEAARQQV AALKAEGVFT IIALTHVGIN VDRQIAREVG GMSLIIGGHS HTPMAPMNNV RTPPYPELIA GPDGKPVVVV TDWEWGRWLG DITVAFNAAG TVIDLQGNPT EVLPSLPADQ GFENRIAVFK GPIEQLRARV VGSAAVDLDG SRTNIRSRET NLGNLVAEAM LAKARNSGAT IAITNGGGIR ASIPAGPVTV GQILEVLPFG NTLALVTLTG SQVIEALNNG VSQVESGAGR FPQVAGLRFT YDPSLPAASR VTSVTVGGAP IDQNASYVVV TNNFMLTGGD GYSVFIRGRN QVDTGFILAD VVEEYIAANS PVNPAVDGRI AIGAAPATTP AQPETPAQPV PATLPNTGGA LTPLAWLAGL GAAALAGGAA LQRSEKE
|
| |