Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_1919 |
Symbol | |
ID | 7266410 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | + |
Start bp | 2352090 |
End bp | 2353319 |
Gene Length | 1230 bp |
Protein Length | 409 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643566756 |
Product | CBS domain containing protein |
Protein accession | YP_002463250 |
Protein GI | 219848817 |
COG category | [S] Function unknown |
COG ID | [COG1993] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0042523 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCCGAAC CAACGGTGCA ACGGGTACGC ATCTATTTAA ATGGTGAAGA TAGCGCAGAC GGTCAACCGC TCTATAAAGT GGTAGTAGAC GAATTGCGTC AGAGTGGTGC GACCGGTGTG ACGGTTCTGC AGGCTTTGAC CGGCTTCGGA CCACGTCGGC AGATGTTACC CAACGCGATG CGGCAGCCGG TCGTAATCGA GTGGGTTGAC AATGCCGTTC GTATCCAACG ATTACTGCCG TTGCTGAACC GCTTGATCGG TGATGCGCTG GTAACGATCG AGCCGGTAGC CATTGTGCAG GGTGTACTGC GTCCGGCCGG TCCGTTCAGT GCCGCCCAAT TAGTCTCCGA TCTGATGCAA GCAGACGCAC CGGTGATTGA CGCCACTGCA CCACTCCTCG ATGTCCTCGA ACCGTTTATC ACCGGTCGGG TGGAAGTATT GGCAGTAGTT GAGAATGATA CCGTCATCGG CACTATCTCG CTGCGTGAAT TAGTGTGGCG TGCCGGTTTA CGGGTACCAC CTTACCTGCT GAGTATGCTT GAGCCGGCCG AACGAGCGGC AGTGCTGGCA CCGCTACAGG CATTAACTGC CGGTGCGATT ACTAATCGTG AGATACGTGG TGTGCATACA ACGATGCCGA TTACGCAAGC CCTGACCCGT ATGATCGAGT GGGGCTACAA CCAAATACCG GTGCTCGATC CGCTCGGGCG GTTGGCCGGG GTGTTCGGGC AACACGAGGT GTTGCAGGCA GTAGCGCACC GGTCTGAATC GGAGGAAGCT ACCGGTTTGG AACTCCAGGT GGGAATGGTG ATGCAAGCGG CAACAGCACG GGCTACGCTT GGTCAATCAT TGGCGACAGC ACTTGCCTTG CTGATCACAA CTCCGGGTCA AAGTTTGTTT GTCGTTGATG GCGATAGGCG TGTTGTGGGT GTCTTGCGGT TAAGTAAGGT ACTATCCAAT TTGCAAGACG ACGAGCGAAC GAGCTTACTG ACTGCATTGC AAAGCACTCA ACGAGTCCAG CCGACAGCGT TGCCTGGGGC ACGTCGCACG ATCGACGCCT TTCTCGAACC ACCACCACCG GTATTGGCGA TCAATACCTC GCTCGGAGTT GCCGCTCGTC AGTTACTCAC AATGAACACG GAGCGGTTGC CGGTGGTAGA CTCAGAAGGG CGACTGTCGG GTATAATCGC ACGTGGTGCC CTCGTGCGGG CATTACTACA ACACGAATAG
|
Protein sequence | MAEPTVQRVR IYLNGEDSAD GQPLYKVVVD ELRQSGATGV TVLQALTGFG PRRQMLPNAM RQPVVIEWVD NAVRIQRLLP LLNRLIGDAL VTIEPVAIVQ GVLRPAGPFS AAQLVSDLMQ ADAPVIDATA PLLDVLEPFI TGRVEVLAVV ENDTVIGTIS LRELVWRAGL RVPPYLLSML EPAERAAVLA PLQALTAGAI TNREIRGVHT TMPITQALTR MIEWGYNQIP VLDPLGRLAG VFGQHEVLQA VAHRSESEEA TGLELQVGMV MQAATARATL GQSLATALAL LITTPGQSLF VVDGDRRVVG VLRLSKVLSN LQDDERTSLL TALQSTQRVQ PTALPGARRT IDAFLEPPPP VLAINTSLGV AARQLLTMNT ERLPVVDSEG RLSGIIARGA LVRALLQHE
|
| |