Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_3830 |
Symbol | |
ID | 7266310 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | - |
Start bp | 4667588 |
End bp | 4669192 |
Gene Length | 1605 bp |
Protein Length | 534 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643568641 |
Product | hypothetical protein |
Protein accession | YP_002465101 |
Protein GI | 219850668 |
COG category | [R] General function prediction only |
COG ID | [COG1106] Predicted ATPases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.531313 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.000259973 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATCGAGC GTATCGTTAT TCATCGCTTC CGTGGCATTC GCCAGGGCGA TCTGAACCAT CTGCGGAAAT TCAATCTGTT TATCGGACCA AACAACAGCG GCAAGACCGC CATCCTCGAA CTGCTCTACC TCAGCGCGAC GAGTGGGCGA CCGGTTCAGT TCATCCGTGA CGATCTGCTG CCTGCCGAGA CCGGTGTGCT CAGGGCGACC ACCTCGGCGC GCACCGATCT GCTGGGCTAC GAGCCGCTGC CATACCTGCG CCAACGTCAT GGCAAGCACG GCGAGTGGGC CGGCAATCCG GCGGTGGTGA CACCAGAAGG CGGGTTGGAG ATCAACTTGC GCCGTCTGCC GAATAGCGAT GGCGCACCTC CGTGGAACTC CTTTCGGCTG GCCGCACCGC TGCCGGACTG GGGCGAGCCG GATGTGTACG CTTTTCGCAA GGAAGATATT GCCCGCATTG CGATGTTTAC CCTGCCACAG CCAACGACGC TCGATCCCAG CATGATTCCA CCCGCGATTG CCGCAGCCGG GATCATCCCG ACCGGCGCAG CCACCGACAC GACCACCGCC GCACCGACAC CAACAACCGA TACAGCGACC GAAGCAGAGG AGTTGGGCAG CGCAGCCACC GACACGACCA CCGCCGCACC GACACCAACA ACCGATACAG CGACCGAAGC AGAGGAGTTG GGCAGCGCAG CCACCGACAC GACCACCGCC GCACCGACAC CAACAACCGA TACAGCGACC GAAGCAGAGG AGTTGGGCAG CGCAGCCACC GACACGACCA CCGCCGCACC GACACCAACA GACACGACGC CGATCTACGA TTGGCACTAC CTCTGGGAAC CGGACTGGGT GTACCGTTGG GATCGGCAGC AACCCATTGA TCGCCTGGCG GTCTGGGTCA CGCAAGGACG GCGACCGCAG CCGCAGCAGG TCGTGTTCTT TTCTTCGCAG ACGGCGAATA GCCATTTCAC CGACCACTTT GCCAAGTGGG CCTATCACCA TGTCAAGGAC TGGCACGAAA CGCTTGCCGG GTTGATGGCG CAGGTGTTTC CGGCACTGGA GGGGGCCAAG ATTGAGGTGC TTGACGCGCC TGACGACCAA CCGGGCCGAA CCGGCTATGT GCGCTTTCCG AACCGAACGC CGCTGGCCAT CGATCAGTTC GGTGACGGCG CCCGTCATGC GTTCAAGTTG CTGGCTGCCC TCACCGCCTT AGCCGCGACG GTTGATGACG ATCATCCCGG CTTGCTCTTG TGGGAGGAGC CAGAGGTGTA TATGCACGCG GCAACCCTCA ACCGTCTGTT ACGCATCGTA GCCGATATTG TTGCTCAAAA ACCAATTCAG GTATGCATTA CCACTCAGAG TCTGGAAGTT CTGGCGTGGC TGATTCTCTA TCTTGATCAA CAATCGGCTA TGCAACCGGA TCAGATCAGC ACGTTTCATC TCAACCTGAA GGATGGACGG TTGCATGTGC GTCCATTTAT TGGCAAAGCG CTCGGCGGAT GGTTCGATTT CTTTGGTGAT CCGCGCCTGA TTGAAGAAGA CGAACTGGCT TCACCACTGA CACGCCTGTT GAGCATTCGG GAGGAACGTG AATGA
|
Protein sequence | MIERIVIHRF RGIRQGDLNH LRKFNLFIGP NNSGKTAILE LLYLSATSGR PVQFIRDDLL PAETGVLRAT TSARTDLLGY EPLPYLRQRH GKHGEWAGNP AVVTPEGGLE INLRRLPNSD GAPPWNSFRL AAPLPDWGEP DVYAFRKEDI ARIAMFTLPQ PTTLDPSMIP PAIAAAGIIP TGAATDTTTA APTPTTDTAT EAEELGSAAT DTTTAAPTPT TDTATEAEEL GSAATDTTTA APTPTTDTAT EAEELGSAAT DTTTAAPTPT DTTPIYDWHY LWEPDWVYRW DRQQPIDRLA VWVTQGRRPQ PQQVVFFSSQ TANSHFTDHF AKWAYHHVKD WHETLAGLMA QVFPALEGAK IEVLDAPDDQ PGRTGYVRFP NRTPLAIDQF GDGARHAFKL LAALTALAAT VDDDHPGLLL WEEPEVYMHA ATLNRLLRIV ADIVAQKPIQ VCITTQSLEV LAWLILYLDQ QSAMQPDQIS TFHLNLKDGR LHVRPFIGKA LGGWFDFFGD PRLIEEDELA SPLTRLLSIR EERE
|
| |