Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_0338 |
Symbol | |
ID | 7268439 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | - |
Start bp | 420647 |
End bp | 421816 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 643565206 |
Product | extracellular solute-binding protein family 3 |
Protein accession | YP_002461720 |
Protein GI | 219847287 |
COG category | [E] Amino acid transport and metabolism [T] Signal transduction mechanisms |
COG ID | [COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain |
TIGRFAM ID | [TIGR01096] lysine-arginine-ornithine-binding periplasmic protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000000810551 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.021176 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCAAAC AGATCGTCTT CGTGCTCGCA GGGTTACTCA GTTTCATCTT AGCCGCGTGT GGCGGTGGTG GTCAGCAGGC CCCCACCACG GCACCCGCCC AACCGGCTAC AACCGCCCCT GAGCAGCCAA CTACCGCGCC GGCTGCTCCG GCACCGACAG CAGCTCCGGC TGCAACGTCG GCGCCGGCCG GTGGTACCAC CAACCAGCCG GCAAGTGGTG GTCTGATCCA GCAGATTTTG GCCCGTGGCC GGTTGATCTG CGGTGTCAAC AACAACCCGC TACCCGGTTT TGCCTCGGTT GATGCGTCCG GCGTCTATTC CGGCTTTGAC ATCGACTTCT GTCGGGCAGT GGCGGCTGCG CTGTTTGATG ACCCCACGAA AGTTGACTTC CGTCCGCTCA GCGCTCAAGA GCGATTCACC GCTCTGCAAA CCGGTGAAAT CGATGTGCTG ATCCGCAATA GCACGTGGAC GCTGGGCCGC GATGGTAACC TGGGTCTCGA TTGGGCACCG ACGACTTTCT ACGACGGCCA AGGCATGATG GTACGCAAGG ACAGCGGAAT CAACACGCTG GAAGATATGG ACGGCGCAAC CATCTGTGTG CAGACCGGTA CCACGACCGA GTTGAACCTG GCCGACCAAT TCCGCGCCCG TGGTCTGACC TTCACGCCGG TCGTCTTCCC CGACGGTGAC TCGACACGCG CCGCCTACGA CGCCGGTCAG TGTGATGGCT TCACCACCGA CAAATCGGGG TTGATCTCGA GCTTAACCCT GCTCTCCAAC CCGGCCGACC ACAAGATTCT CGAAGTCACG ATGTCGAAAG AGCCACTTGG GCCGGCAGTT AAGCAGGGTG ATCCGCAATG GTTTGATGCA GTACGCTGGA TTGTCTTCGC TACCTTCCAA GCCGAAGAGT ACGGAATTAC GTCGCAGAAC GTGAACGATT TCTTGAACAG TGACGTTCCT GAGATCCGCC GCTTCCTCGG CATCGAAGGT GATCTGGCGG CCGGTATCGG GTTGCCCAAT GATTTCGCAG TACGCATCAT CCGTCACGTC GGTAACTATG CGGAGATTTA TAACCGCAAC CTTGGCCCCG ATACCCCCTT CAACTTGCCT CGTGGTTTGA ATGCGCTGTA TACGGACGGT GGTCTGCTCT ACTCACCACC CTTCCGCTAA
|
Protein sequence | MRKQIVFVLA GLLSFILAAC GGGGQQAPTT APAQPATTAP EQPTTAPAAP APTAAPAATS APAGGTTNQP ASGGLIQQIL ARGRLICGVN NNPLPGFASV DASGVYSGFD IDFCRAVAAA LFDDPTKVDF RPLSAQERFT ALQTGEIDVL IRNSTWTLGR DGNLGLDWAP TTFYDGQGMM VRKDSGINTL EDMDGATICV QTGTTTELNL ADQFRARGLT FTPVVFPDGD STRAAYDAGQ CDGFTTDKSG LISSLTLLSN PADHKILEVT MSKEPLGPAV KQGDPQWFDA VRWIVFATFQ AEEYGITSQN VNDFLNSDVP EIRRFLGIEG DLAAGIGLPN DFAVRIIRHV GNYAEIYNRN LGPDTPFNLP RGLNALYTDG GLLYSPPFR
|
| |