Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cag_1938 |
Symbol | |
ID | 3746698 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium chlorochromatii CaD3 |
Kingdom | Bacteria |
Replicon accession | NC_007514 |
Strand | - |
Start bp | 2469736 |
End bp | 2471013 |
Gene Length | 1278 bp |
Protein Length | 425 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 637774473 |
Product | Sel1 repeat-containing protein |
Protein accession | YP_380229 |
Protein GI | 78189891 |
COG category | [R] General function prediction only |
COG ID | [COG0790] FOG: TPR repeat, SEL1 subfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATGTTA TGAAAAAATT TATCACAAGC ATTGTTATAG CAAGTTTAAC GCTGTTGGCG ATTAATGGAT TTTGTGAAAC ACCCTCACAA AAACAAATAT CTCAATGGCA ACAAGCTGCT GCGCAAGGTA ACTCAGAAGC ACAATTAAAT CTTGGTTATG CCTATGATCA TGGAGAAGGA GTGAAGCAAG ATTATGCGGA GGCGATAAAA TGGTATCGGT TGTCGGCAGC TCAAGGTGAT GTTAAAGCAC AATTTAATCT TGGGGTGATG TATTATAACG GTGAAGGAGT AAAGCAAGAT TATGCAGAGG CGATAAAATG GTTTCGTTTA TTAGCAACTC AAGGTGATGC AATAGCACAA TTTAATCTTG GGGTGATGTA TTATAACGGT GAAGGCGTGA AGCAAGATTA TACAGATGCG TTGAAATGGT TTCAGTTATC AGCAGCTCAA GGAAATGCAA TGGCACAAAA CAATCTTGGT GTGATGTATG CTAAAGGTGA AGGCGTGCAG CAAGATTATG CAGAAGCGTT GAAATGGCAT CGTTTATCAG CAGCACAAGG CAATGCAATG GCACAAAACA ATCTTGGAGC GATGTATTAT AAGGGTGAAG GAGTCGAGCA AGATTATGTG GAGGCACTAA AATGGTATCG GTTATCGGCA GCACAAGGAG ATGCGGTTGC GCAATGGATT CTCGGTTTGA TGTACTATGA AGGTCAAGGA GTAAGGCAAG ATTACGGAGA AGCGATAAAA TGGTATCGTT TATCAGCGGC TCAAGAAGAT GCGAAAGCGC AATATAACCT TGGCTTGATG TACTACAATG GTGAAGGTGT GAAGCAAGAT TATGCCGAAG CGTTGAAATG GCATCGTTTA TCAGCAGCAC AAGGCAATGC AATGGCACAA AACAATCTTG GAGCGATGTA TGCTAAAGGT GAGGGCGTGC AGCAAGATTA TGCAGAAGCG TTGAAATGGC ATCGTTTATC AGCAGCTCAA GGTGATGCCA CAGCACAAGG TATTCTCGGT TTGATGTACT GTGAAGGTTA TGGAGTAAGG CAAAATTACG GAGAAGCGCT AAAATGGTAT CGTTTATCGG CAGCTCAAGG AAATGCAGGT GCACAATACA ATCTTGGTCT GATGTATTAT AACGGTACAG GTGTTAGGCA GAGTAAAGCA ATTGCAAAAG AGTGGTTTGG CAAAGCTTGT GATAATGGTT TCCAAGATGG ATGTGATGCA TATCGGGAGT TAAATGAAGC TGGGGCAAAA ACTAATAGGA GCCGGTAA
|
Protein sequence | MNVMKKFITS IVIASLTLLA INGFCETPSQ KQISQWQQAA AQGNSEAQLN LGYAYDHGEG VKQDYAEAIK WYRLSAAQGD VKAQFNLGVM YYNGEGVKQD YAEAIKWFRL LATQGDAIAQ FNLGVMYYNG EGVKQDYTDA LKWFQLSAAQ GNAMAQNNLG VMYAKGEGVQ QDYAEALKWH RLSAAQGNAM AQNNLGAMYY KGEGVEQDYV EALKWYRLSA AQGDAVAQWI LGLMYYEGQG VRQDYGEAIK WYRLSAAQED AKAQYNLGLM YYNGEGVKQD YAEALKWHRL SAAQGNAMAQ NNLGAMYAKG EGVQQDYAEA LKWHRLSAAQ GDATAQGILG LMYCEGYGVR QNYGEALKWY RLSAAQGNAG AQYNLGLMYY NGTGVRQSKA IAKEWFGKAC DNGFQDGCDA YRELNEAGAK TNRSR
|
| |