Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cag_0101 |
Symbol | |
ID | 3747589 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium chlorochromatii CaD3 |
Kingdom | Bacteria |
Replicon accession | NC_007514 |
Strand | - |
Start bp | 115674 |
End bp | 116975 |
Gene Length | 1302 bp |
Protein Length | 433 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 637772627 |
Product | hypothetical protein |
Protein accession | YP_378422 |
Protein GI | 78188084 |
COG category | [R] General function prediction only |
COG ID | [COG3550] Uncharacterized protein related to capsule biosynthesis enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.292641 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAACGG CATTTGTAAA AATATGGGGC GAATTAGTGG GAGCGGTAGC TTGGGATGAT GCTACCGGCT ATGCCACGTT TGAATATGAT GCCAAATTCA AGAGTAAAGG CTGGGAATTG GCTCCCTTAC AAATACCGGT AAATGCAACC AAAAGCAACT TTAGTTTTCC TGCCCTGCGT AAAAAGGCGG ATCCTGCTTT AGATACCTTC AAAGGCTTAC CCGGCTTATT AGCCGATATG CTGCCCGATC GTTACGGAAA TGAGCTCATC AACTTGTGGT TGGCTCAAAA GGGTCGTCCG TTAGACAGCA TGAATCCTGT AGAAACTTTG TGCTTCATAG GCACTCGGGG AATGGGTGCC TTGGAGTTTG AACCCACCAC CTTAAAGGAA AGCAAAAAAG CCTTTTCGCT GGAAATTGAT AGCTTGGTTG AGATAACTCA AAAAATGCTC ACCAAAAAAG AAGCATTCGT AACCAACCTG CAGGAAAACG AAGAAAAAGC CATTCTTGAA ATACTACGCA TTGGAACATC TGCCGGTGGT GCTCGGCCTA AGGCAGTGAT TGCTTACAAC GAAAGAACAG GTGAAGTACG ATCTGGTCAA ACCAATGCGC CACAGGGGTT TGAGCATTGG CTGCTAAAGT TGGATGGGGT GAGTGAGGTG CAGTTGGGCG CAAGTCATGG GTATGGCCGG GTGGAAATGG CGTACTACAA CATGGCTGTA GCTTGTGGCA TTCAGATAAT GCCTTCCAGA TTATTGGAAG AAAACGGCAG GGCACATTTT ATGACCAAGC GTTTTGACCG TGAAGGCGGT GCAGCCAAAC ACCATATTCA AACCTTTTGT GCCATGAAGC ACTTTGATTA CAATCTTGTA ACTAATTTTA GTTACGAGCA GTTGTTTCAA ACGATGCGGG AACTAAAGCT ATCCTATCCG GATGCTGAGC AGTTGTTTCG CAGGATGGTA TTCAATGTAG TAGCCCGTAA CTGCGATGAC CATACAAAGA ACTTCGCCTT CCGGTTAAAA AAGGATGGAA AATGGGAACT GGCTCCGGCC TATGATGTTT GCCATGCCTA TCAACCCAAA CATCAATGGG TAAGTCAACA TGCTTTAAGC ATCAATGGCA AACGAACTAA TATTACTAAA GACGATTTGC TCACCATTGG CAAATCCATC AAAAATAAAA AGGCTGCAGA AACCATTGAG GAAATCAGTA ACACAATAAG CCAATGGAAA ACCTTTGCCG ATGAAGTAAA GGTGTTACCC AAACTGCGTG ATGAAATAGC CGCTACATTG ATTCGATTAT AA
|
Protein sequence | MKTAFVKIWG ELVGAVAWDD ATGYATFEYD AKFKSKGWEL APLQIPVNAT KSNFSFPALR KKADPALDTF KGLPGLLADM LPDRYGNELI NLWLAQKGRP LDSMNPVETL CFIGTRGMGA LEFEPTTLKE SKKAFSLEID SLVEITQKML TKKEAFVTNL QENEEKAILE ILRIGTSAGG ARPKAVIAYN ERTGEVRSGQ TNAPQGFEHW LLKLDGVSEV QLGASHGYGR VEMAYYNMAV ACGIQIMPSR LLEENGRAHF MTKRFDREGG AAKHHIQTFC AMKHFDYNLV TNFSYEQLFQ TMRELKLSYP DAEQLFRRMV FNVVARNCDD HTKNFAFRLK KDGKWELAPA YDVCHAYQPK HQWVSQHALS INGKRTNITK DDLLTIGKSI KNKKAAETIE EISNTISQWK TFADEVKVLP KLRDEIAATL IRL
|
| |