Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Clim_2079 |
Symbol | |
ID | 6355057 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium limicola DSM 245 |
Kingdom | Bacteria |
Replicon accession | NC_010803 |
Strand | - |
Start bp | 2293451 |
End bp | 2294704 |
Gene Length | 1254 bp |
Protein Length | 417 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 642669675 |
Product | domain of unknown function DUF1745 |
Protein accession | YP_001944087 |
Protein GI | 189347558 |
COG category | [S] Function unknown |
COG ID | [COG3287] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.00286991 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGCAA TTTCAAAAAT ACAAGCCGGC TCAGGAGTGA GCGAGCTGAC GGATTCTTAT GCGGCAGGCG TTAGCGCCGC TTCCGAAGCT CTTACGGGGA TTTCAGGCAG AACACCTCAG GTACTGATCG TATTCGGTGC GATGCGTTTC GATCATCGTG AACTGCTCAA AGGTATAATG TCCGTAGCTG GTGATATTCC TCTGGTTGGG GGAACAACTG CCGGCGAGAT ATCGACCGGC GGTTTTTCGA CCGGCTCGGT CGTGGTTATG GCGTTAGCTT CGGATTATCT GCATTGTGTT GAGGGCATAG GCCACGACAT GAGTAGTGAC GAAGCTGCAT GCGCCGTTGA AATGGCCACA GATATCCTCT CAAAGGGTTC ATTCGACAAG GACGCATCCC TCATGGTTTT TCCCAATGGC ATGGGAGGAG ACGGCCTTCG TGTTCTTGAC GGTCTTCATT CCGTTTTAGG CCGGGGTTTT GAAATTTCAG GCGGGTTTCT GGGAGATGAC GAGCGCTTTC AGAGCACCTT TCAGTACTAT AACGGCAAGG TATACAAAGA TGCCATTGTC GGACTGATGA TCTCCAGAAA GGATTGCATC AGGACAGGTA TCGGAGTCGG GAGCGGTTTC GAGTCGATCG GTAACAGTTT CGTTTGTACC GCCTCGGAAG GTAATCTTGT CACGGAGTTT GACCATGTTC GGGCTCTTGA TCTTTACAAG GATTTTCTTG GTGAAGAACG TTCTGCACGT CTTCCAGGGG TTTGCCTTGA ATATCCGTTC GGGCTTATCG ATCCGGCGCT GTCCACAGGG GGGGAAGAGC TGTTCCAGTT GCGTTGCGGC CTTTCGGTCG ATCATGCAAA AGGCACGATC TCTCTTGCTG CTTCGATTCC ATCCGGAAGT TCAGTAACGC TGACAACTGC CTCAAGGGGA GATATCATAC AAGGGGCGCG TTTTGCTGCC GAACAGGCAA AGGCATGTCT GTCAGGGGCT ATTCCGCGAC TTGTCGTCAT GTTCAGTTGC GTGGGGCGCA AGCTTGTTCT CGGAAGACGT ATACAGGAGG AGGCGGCCAC CATTAAGGAG TGCCTCGGCA GCGATGTTCC GTTGATCGGA TTCTACACGT ATGGAGAAAT AGGCCCGGTC AACAAAATGA AGCCAGGCTT TGAGACAGCT AAATTTCATA ACGAAACGGT TGTGCTCTGG GTGATCGGCG AAGAGCATTC CGGATCGCCT GCGACAGCCG TTGTGCGGCC TTGA
|
Protein sequence | MNAISKIQAG SGVSELTDSY AAGVSAASEA LTGISGRTPQ VLIVFGAMRF DHRELLKGIM SVAGDIPLVG GTTAGEISTG GFSTGSVVVM ALASDYLHCV EGIGHDMSSD EAACAVEMAT DILSKGSFDK DASLMVFPNG MGGDGLRVLD GLHSVLGRGF EISGGFLGDD ERFQSTFQYY NGKVYKDAIV GLMISRKDCI RTGIGVGSGF ESIGNSFVCT ASEGNLVTEF DHVRALDLYK DFLGEERSAR LPGVCLEYPF GLIDPALSTG GEELFQLRCG LSVDHAKGTI SLAASIPSGS SVTLTTASRG DIIQGARFAA EQAKACLSGA IPRLVVMFSC VGRKLVLGRR IQEEAATIKE CLGSDVPLIG FYTYGEIGPV NKMKPGFETA KFHNETVVLW VIGEEHSGSP ATAVVRP
|
| |