Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cag_1092 |
Symbol | |
ID | 3747959 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium chlorochromatii CaD3 |
Kingdom | Bacteria |
Replicon accession | NC_007514 |
Strand | - |
Start bp | 1475323 |
End bp | 1478955 |
Gene Length | 3633 bp |
Protein Length | 1210 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 637773623 |
Product | hypothetical protein |
Protein accession | YP_379397 |
Protein GI | 78189059 |
COG category | [S] Function unknown |
COG ID | [COG4717] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.69355 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAATTC GTCGTTTTGA ATTGAACGCT TTCGGACCTT TTAGCGGCAA TGTGCTCGAT TTTAACTCAC CAACCCCCGG CTTGCACATT GTGTATGGAG CCAACGAAGC GGGCAAAAGC AGCGCGATGC GGGCGCTTTA CGCATGGTTT TTTGGCTATC CACTCCGCAC TACAGACGAT TTTCTCCATA AAAAAAGCAA CCTTCTCCTT AGTGGAACGC TTGAAAATAA GCAAGGTGAA GTGCTCACTT TTTCGCGCCG CAAACGCAAA GAGCGAGATT TGTTTGATGG AAATGATCAG CCGCTTGAAG CACAAACGCT GGAACATTGG TTGCTGGGAA TGGATCGCGA ACTTTTTCAA GCGCTCTATG CCATGAGCCA CGAAAGCCTT GCGCTTGGTG GTCAAGGCAT TCTTGATGAA GAGGGTGAAA TTGGCAAAGC GCTTTTTGCG GCAGGCGCAG GTTTAGCTTC GCTACGCCCC ATGCTTGCGC ACTTGCAAAG CGAAGCCGAT GAGCTTTTTC GCCCGCAAGG CGCCCGCCAG CAACTTAACG AAGCGCTTGC ACGCCACCGC ACCTTGCAAC AGCAATTACG TGAAGCCACA CTTTCAGGCT CCGTATGGCA GGAGAAAAAA GAGGCGTTGG AACAAGCCGA AGCAAAACGC AACGCCTTGC AAGTTCGCAA GCAAGAGCTT GAAACCGAAA AGCATCGCCT TGAACGCTTG CAGCACGCGC TGCCCGAATT AGCCGATCGC AAACATGTAG TGGAGCAACG CGCGGCACTT GGCAAGGTGC CGCTACTCCC TGCTGATTTT GCAGCGCAAC GCGAAGCCCT CCAAAAGCAA CTGCATCTTG CTCAGCATAA TTACGAACGT GAACAAGAGC GTATTACGGC ACTCCAGCAA AGCATTTCGA GCCATCACGT CAACCATGCG CTTTTAGAGC AAGCTGCCGT GTTGGACGAG CTGCATCAAC GGCTTGGCGA ATATCGCAAA GGCAAAAACG ACCTTCCCCA ACGCCAAAGC CAGCGTGCTG CCGCATTGCA AGCGGCAATG GATATTCTTC GCCCACTGTG GGCTGATCTT GCTGGTAGTG AAGAGGCTAT GGATGCAACG GACGATTCGC CAACATTAAT GCAACGCTTA CAAAAAGCAC TTCTCAAGAA AAAAGAGGTG CAGCGCCTTG CCACCCATTT TGAAGCCCTT GTAAGCTCAG GCAAAAGCGC CCGCCAGCAA GTGCAAGAGA GCGAACAAGC TTTGGAGCAG TTGCAGCGTG ACCTTGCTGC CCTCCCCATG CAGGGTGATA GTAATCAGCT TGAACAAACC TTGCGCATAG CCGAGCGCAA CGCAGCGCTT GATCGCGATA TTGCCGAGCT TGAGCAAAGC CTCCGCCATA GTGAGCAGGA GTGCCACGCC ATGCTACAAC GCTTGACGCT ATGGCATGGC ACTTTGGAGC AAGTGCCAAC ACTTCCTCTC CCACTTCCAG AAACTATCAG CCGCTTTGAC GAAGCCTTTC AGCGTTTGCA AAGCGATACG CTTGCCCTTC GCGCACAAGC CGAGGGGGTA GAAAAGCGCT TGCAAGAAAT CACCACCGAA TTGGAGCAAC TTGCGGCTGA AAGCCATGTG CCCTCAGTTG AAGAGCTGCA ACACAGCCGA GCCGAGCGCA ATAAGGGGTG GGAATTGCTG AAACGGCAAT GGCTCCAGCA GGAAGATGTA ACGGCAGAAA GTAACGCCTA CAGTGCCGCC CATCCACTCC ATGAAGCCTA TGAAATAATG GTGAATGCGG CTGACCAGCT TGCCGACCAA CTTTACCGTG AAGTGGAGCG CGTGCGCCGC CATACCGCTT TAACGGCTGA AGCGAAAAAG CTCCACCATC AACACACACA TCTGCATGAG CGTTTGGCAA CCCTTGCAAC TGAAGAGGCA GCGCTGCACA CCGCGTGGCA AGAACAATGG CGCACAACCG AAATTGAAGC GCTACCACCT CGCGAAATGG TGGCATGGGT AGCCACATTT GAGGCACTCC GCCAGCACGT TCGAGAGCGC GATAAGCTAC TGCTTGAGCG CAATGTACGC CACAAACGCC GTCAAGAAGC GCACGAGCAA CTTCACCAAG CAGTGGAAGC GGTGGCTCCA CCCTTCCCTA TAAAAAATAA TGAGTTAGCC CCGCTACTTC AATACGCCCA GCAGCAGCTT AGCCGTATGC AAGCGGTTGA AAAAAGAGGC GAAAACTTGC TAAATCGCCA GCGCGACATA ACGCACAACT TAGAGAGCTC ACGCCAGCTT TTAAACCGTG CCGAAGAGGA GCATCGTGAG TGGCGCAAAG AGTGGATAGC CGTAACGAGC GCCCTTGAGC TTACGGGACA AGCTCAACCA ATGGAAATAG TTGATAGCGT TGAAGCAATG CAGCAAGCCT TAACAAAGTT AAAAGAGGCA GAGGAGTTTC GTAAACGCAT TGAAGGCATT GAACGCGATA TGCGGCAGTT TGAGCTGGAT GTAGCAACTG CCACCGCCAC ACTTGCGCCT GACGCGCAAG AGAGCGATGG AGCCAAGCGC GTTGCCATGT TGCATGAGCG CTTGGATGAA GCACGCCGCG AACAAACGCT GTTGCAGCGC GAAAAGGATG AAGTAACACA GCACAAAGAA GCCCTCCGCC GCCATGCTGC AACGCTGCAA GAGGGGGAAC TTCAGCTTAC CGCCATGTGC CAGCAAGCCG AGTGCGCCAC GCCAACCGAC TTGCCAATAG CCGAAGCGCG TTCTCAGCAA GCGCAAGAGT TGCACGAAAA ACTCATGGCA GTTGAAACCC GCTTAGTGCG CATTGCAGGC AGCGCCTCAC CCGAAGCTCT TGAAGCACTT GAAACCGAAG CTGCCACCGT TGAACGCGAT GCCCTACCAA GCCATATTGA AACCATTACA ACCGAAATTC ATCAAGAAAT AGAGCCAGAA ATTGATCAAC TCAATGAACT GAGAGGACGA CTTCGCAATG AGCTGAAACA GATGGAAAAA GAAGATGGTA ATGCGGCTGA CCTTGCCGAT GCCGCCCAAC GCGAACTTGC CCGCATTCGC CGCTTAACCA ACCGCTACAT TCGCCTTCGC TTAGCCGAAA CCATGGTACG CAATGCAACC GAGCGCTACC GCAGCAGTAG TGAACGCCCT GTGCTTAGCC TTGCCTCCAC CTACTTTGCA ACACTCACGC TGCAATCATT TGTGGCATTA GATACCGAAA GTGATGACAA CGGGCATATT GCGCTAATGG GCGTTCGCAC CAACGGCAAC CGCATTGGCG TTGAAGCCAT GAGCAGCGGC ACACGCGACC AGCTCTACTT AGCCCTACGC CTTGCAACCC TGCAATGGCG AATGCAACAA AGCGAACCCA TGCCCATTAT TGCCGACGAC ATTCTTATCA CCTTTGATGA TGCCCGCTCA CGCTCCACCC TTCAAGCCCT TGCAAAACTT GGCGAAAGCT GCCAAATCAT CCTCTTTACC CACCACCGCA CCATTGCCGA TATGGCAAGC CACCGCGCAT TTAAAGGCAC CGTTTTTCTC CATACACTTG GCACCACCAA CGAAAGCGAA CACAACAATG CAGAAACCTC CGCACAACCA CCAAAGCCTG AAAATTTAAC CTTGTTTGGA TAA
|
Protein sequence | MKIRRFELNA FGPFSGNVLD FNSPTPGLHI VYGANEAGKS SAMRALYAWF FGYPLRTTDD FLHKKSNLLL SGTLENKQGE VLTFSRRKRK ERDLFDGNDQ PLEAQTLEHW LLGMDRELFQ ALYAMSHESL ALGGQGILDE EGEIGKALFA AGAGLASLRP MLAHLQSEAD ELFRPQGARQ QLNEALARHR TLQQQLREAT LSGSVWQEKK EALEQAEAKR NALQVRKQEL ETEKHRLERL QHALPELADR KHVVEQRAAL GKVPLLPADF AAQREALQKQ LHLAQHNYER EQERITALQQ SISSHHVNHA LLEQAAVLDE LHQRLGEYRK GKNDLPQRQS QRAAALQAAM DILRPLWADL AGSEEAMDAT DDSPTLMQRL QKALLKKKEV QRLATHFEAL VSSGKSARQQ VQESEQALEQ LQRDLAALPM QGDSNQLEQT LRIAERNAAL DRDIAELEQS LRHSEQECHA MLQRLTLWHG TLEQVPTLPL PLPETISRFD EAFQRLQSDT LALRAQAEGV EKRLQEITTE LEQLAAESHV PSVEELQHSR AERNKGWELL KRQWLQQEDV TAESNAYSAA HPLHEAYEIM VNAADQLADQ LYREVERVRR HTALTAEAKK LHHQHTHLHE RLATLATEEA ALHTAWQEQW RTTEIEALPP REMVAWVATF EALRQHVRER DKLLLERNVR HKRRQEAHEQ LHQAVEAVAP PFPIKNNELA PLLQYAQQQL SRMQAVEKRG ENLLNRQRDI THNLESSRQL LNRAEEEHRE WRKEWIAVTS ALELTGQAQP MEIVDSVEAM QQALTKLKEA EEFRKRIEGI ERDMRQFELD VATATATLAP DAQESDGAKR VAMLHERLDE ARREQTLLQR EKDEVTQHKE ALRRHAATLQ EGELQLTAMC QQAECATPTD LPIAEARSQQ AQELHEKLMA VETRLVRIAG SASPEALEAL ETEAATVERD ALPSHIETIT TEIHQEIEPE IDQLNELRGR LRNELKQMEK EDGNAADLAD AAQRELARIR RLTNRYIRLR LAETMVRNAT ERYRSSSERP VLSLASTYFA TLTLQSFVAL DTESDDNGHI ALMGVRTNGN RIGVEAMSSG TRDQLYLALR LATLQWRMQQ SEPMPIIADD ILITFDDARS RSTLQALAKL GESCQIILFT HHRTIADMAS HRAFKGTVFL HTLGTTNESE HNNAETSAQP PKPENLTLFG
|
| |