Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Clim_0722 |
Symbol | |
ID | 6356003 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium limicola DSM 245 |
Kingdom | Bacteria |
Replicon accession | NC_010803 |
Strand | + |
Start bp | 792773 |
End bp | 793738 |
Gene Length | 966 bp |
Protein Length | 321 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 642668348 |
Product | protein of unknown function DUF1568 |
Protein accession | YP_001942783 |
Protein GI | 189346254 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1943] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 0.645218 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGCGAG GGCCGAGACT TGACGCTCCG GGAACCCTGC ATCATGTGAT TATTCGGGGG ATAGAGCGGG GCAGCATTGT TCGTGACGAT ACTGATCGGA AAACGTTTCT CGACCGCATG GGACTGCAGG CCAGGGGCTC GGGTACCAGC ATCTACGCAT TCGCCCTCAT GACGAACCAT GCGCATATTC TTCTGAAAAG CGGTCCGGCT GGCATCTCCA CCTTCATGCG CCGCCTGCTG ACCGGTTATG CCCAGTATTT CAACCGCCGA CATACGCGGG TCGGGCACCT GTTCCAGAAT CGCTACAAAT CCATTATCTG CGAAGAGGAA GCGTACTTCG ACAAGCTGGT GGCGTACATC CATCTCAATC CGTTGCGGGC AGGACTGGTC GAATCGTTCG AAGAGCTTGC ACGGTATCCT TGGTGCGGCC ATGCCGTGCT GCTGAACAGG GTCCGGTACG ACTGGATGGA TCGGGACTAC GTGCTGCTGT TTTTCGGTCA GAAGGAGGGG CCGGCACGGA AGGCGTATCT GCAGTTTCTC GAAGAGGAAC TCGGTGTCGA TCGGGAAAAG GAGTTGTCGG GAGGCGGATT CGTGCGATCG CAGGGCGGAT GGTCGAAGGT GCAGTCCATG CGCAGGCGGG GCGAGAAAGC TCTTGGCGAC GAGCGGATTC TCGGCGGAGA CGAGTTCGTG AAGGAGGTGC TGAAGGAGGC TGAAGAGCGC AGGGATGTAC TGCTGCCGGA ACACGAGCGG CTGCACTTGC TCGATGATGA TATCGAGCGT GCGTGCAGGG ATGCCGGCGT CACCCCGGTT TTCCTCCGCT CCGGGAGCAA GGCTGGCAAG CTGCCGTCCC TGAGAAAAGA GCTTGCAAGG AAAGCCCTGC TGGAGTACGG CATGTCGCTT GCTGAAACAG TAAGACAGCT CGGTGTGACT GCCAATGCCG TGAGCTATAT GCTCAAGCCG CAATGA
|
Protein sequence | MPRGPRLDAP GTLHHVIIRG IERGSIVRDD TDRKTFLDRM GLQARGSGTS IYAFALMTNH AHILLKSGPA GISTFMRRLL TGYAQYFNRR HTRVGHLFQN RYKSIICEEE AYFDKLVAYI HLNPLRAGLV ESFEELARYP WCGHAVLLNR VRYDWMDRDY VLLFFGQKEG PARKAYLQFL EEELGVDREK ELSGGGFVRS QGGWSKVQSM RRRGEKALGD ERILGGDEFV KEVLKEAEER RDVLLPEHER LHLLDDDIER ACRDAGVTPV FLRSGSKAGK LPSLRKELAR KALLEYGMSL AETVRQLGVT ANAVSYMLKP Q
|
| |