Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_2082 |
Symbol | |
ID | 7267589 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | + |
Start bp | 2546589 |
End bp | 2548571 |
Gene Length | 1983 bp |
Protein Length | 660 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 643566917 |
Product | NHL repeat containing protein |
Protein accession | YP_002463406 |
Protein GI | 219848973 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTCTAC GTTTGCTGAT CGCCTTACTC CTTACGACAC TGATCAACGC TTGTGGGACA CCACCGCTTC CAACACCGGA ACCGGTCGCG CTTGGTCCGA CGGCAGTGAT ATTGAACGAA GCGACCACGT TTAGCGAATT GAACGTGCGG TTACGCTTAC CTGCCGGCTG GCAGAGCCGC ATCGAGAGTG GGATGTTGCG ACTCGCTCCC AACATGACAA CCCTCGAAGC CGATGTCATC AATGAGCCGA TGATCCTCCT TGATACCACA TCACTTACCA CCTTGACCAC GCAATACGGT TCGTCGGCCG CTAACCCGGA AACCATTTTC GAGCTGGCGA GTGGTGCGAT CCAATCGGCC GGTTATACCA TAGCACCTAC CAAACCGATA CACCTTGGCA ACGCACATGG CGTAGTTGCC GACATTACCG GACCGACCAG TACCGGTCGA TTACTCGTCC TCATTGACGA GACCCGTGCA GTACGCATTT TGGTACAGGC GGCAAACGAC CAGTGGGTAC GATCACAAGC GCTGATCGAC AGCATACTGG CAACCATCGA ACTGCTCCCC GTACCATCCC CTACACCTAC CCCGACCAAT CTTGCCGCCC AACCACAAAT TGTGCGCTCT GGACCACCGG GCTTTGTGAT GCGGATCGGT GGGCGGAGTG GCCCGGCCAA CAGCCGCTTC ATTGCCGCCC GCGGCTTAGC CGCCGCACCC GATGGGACGA TCTACTTGGC CGAAAGCGGA CGTGGGGTCT GGGTCTTTGC CCCCGACGGG ACATTACGCC AGACGTTCGG CGCCGATGAG CTACTCGACG CCTACGACGT AGCCCTCGGC CCTACAGGCG ACATCTACGT CGCCGATTAT GGTCGTAACG CTATCGTCCG TTTTAGCAGC GATGGCACCT TCCTCAGTCG ATGGGGCGGC CATGGCGACG CACCTGACCA ATTTGGGCTT TCAGCACCCC AACGGATTGC AGTGGGGAAT GACGGCAGTG TCTACGCGCT CGATACTCGT CCTGGTGCGG ATGGGCTAGC CGCAAGTAGT ATTGTGCGTT TCAGTGGTGA AGGGCGCTTC CTTGAACGGA TCGAACTACC ACCCGATTTA GCGCCGGCCG ATTTAGTCGT CGACCCCGGT GGTGTCATCT ATCTGGCCGA GAATTTTGCC GGCGTGATCG TTAAGCTTGC CCCCGACGGT ACAGTTATCG CCCGCTTGGG CGATCCGGCC GATCCTACGC AATTCGCCGG ACCGGTACTC GATCTTGATC GGGCCGGTTA TCTCTATCTT GCCACCTATA CCGGCATCAT CTTACGACTG GCGCCCGACG GAACGATTGT CGCACGCGGG GGTAGTCCGG CTACCCCCGG CAGCCTGCCG AACCCCGGAG AGATCAGTCT GCCCAACGGA ATTGTGGCTG CACCCGGTGG TGTTGTATGG GTGAGCGACA ACAGTGGTGA GTACAGCGCA ATCTCGGCAT TTCGGCTCCA AACCGACGCC GCGGCCCTAG CCACGGCAAT GGCACTCACG CCTACCGCCC TCACAGTGGT CGAAACAGCG CAGCAGTGGG CGGTTGCGGC TACCGCCAGC AGCTTCTACG CTCCCGACTA CGATCCTGAC GGCGTCATCG GCCCACCCAA CGTACCTGGC TGCCAAGACA GTCCTGACGC TTGGGCGCCG GCCATCCCCG GCAGCCGTGA AACCCTCACC GTCACCTTTG CCGAGCCAAT GTTTGCCAGT GCTCTGACCA TTTATCAAAA CCACCAACCC GGATACATCA CGCATGTCGA ACTTATTGAT GAGCAGGGCA CTGTGCGAAC AGTCTACCGC GCCGACCCCA CCCCTGCGCC AGAGTGTCCG TTTGTCACCA CGATCACCTT CGAGCAAACA CTCACACGTA TTGTTAAGGC GCAAATCACG CTTAATCAGC GGGATGGCAG TTGGAGCGAG ATCGATGCGG TGGCCTTAAT CGGCATACCC TAA
|
Protein sequence | MRLRLLIALL LTTLINACGT PPLPTPEPVA LGPTAVILNE ATTFSELNVR LRLPAGWQSR IESGMLRLAP NMTTLEADVI NEPMILLDTT SLTTLTTQYG SSAANPETIF ELASGAIQSA GYTIAPTKPI HLGNAHGVVA DITGPTSTGR LLVLIDETRA VRILVQAAND QWVRSQALID SILATIELLP VPSPTPTPTN LAAQPQIVRS GPPGFVMRIG GRSGPANSRF IAARGLAAAP DGTIYLAESG RGVWVFAPDG TLRQTFGADE LLDAYDVALG PTGDIYVADY GRNAIVRFSS DGTFLSRWGG HGDAPDQFGL SAPQRIAVGN DGSVYALDTR PGADGLAASS IVRFSGEGRF LERIELPPDL APADLVVDPG GVIYLAENFA GVIVKLAPDG TVIARLGDPA DPTQFAGPVL DLDRAGYLYL ATYTGIILRL APDGTIVARG GSPATPGSLP NPGEISLPNG IVAAPGGVVW VSDNSGEYSA ISAFRLQTDA AALATAMALT PTALTVVETA QQWAVAATAS SFYAPDYDPD GVIGPPNVPG CQDSPDAWAP AIPGSRETLT VTFAEPMFAS ALTIYQNHQP GYITHVELID EQGTVRTVYR ADPTPAPECP FVTTITFEQT LTRIVKAQIT LNQRDGSWSE IDAVALIGIP
|
| |