Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cag_1932 |
Symbol | |
ID | 3747307 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium chlorochromatii CaD3 |
Kingdom | Bacteria |
Replicon accession | NC_007514 |
Strand | - |
Start bp | 2463544 |
End bp | 2464914 |
Gene Length | 1371 bp |
Protein Length | 456 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 637774467 |
Product | CBS |
Protein accession | YP_380223 |
Protein GI | 78189885 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0017665 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAATAT TTCTACTTTT TGTTCTCATA CTTGTCAATG GCGCGTTTGC CATGTCGGAA ATTGCGTTAG TAACAGCTAA ACGTTCTCGA CTTTCGCGCC TTGCGGACGA TGGGGATAAA TCTGCCACTA CGGCAATGAA GTTGGGGGAA GATTCCACCA GCTTTCTTTC CACCATTCAA ATTGGCATCA CCTCCATTGG TATTCTCAAC GGTATTGTTG GAGAAGGGGC GTTAGCCGTT CCCTTTTCAC TATTCATTCA TTCGGCAACA GGCATTGAGT TAGAAACCGC TCAGCTTATT GCTACCGTTG TTGTGGTGCT TGGCATTACC TACGTTACCA TTGTGGTGGG TGAATTGGTA CCAAAACGGC TTGGGCAGCT TAATCCCGAA CAAATTGCCT GTTTGGTTGC TCGCCCCATG CAAATACTTG CCACAATTAC TCGTCCTTTT GGGCGCTTGC TCTCCTTTTC AACCAACACG TTGCTTCGTT TAATGGGGGT TAAACCGCAA ATTACCCCAA GCGTTACTGA AGAGGAAATT CATGCTATGC TTGAAGAGGG TTCCGAAGCA GGGGTGATTG AACAGCAAGA GCGCGATATG GTGCGCAATG TGTTTCGCCT TGACGACCGC CAGCTTGGTT CGCTCATGGT ACCTCGTGCC GATATTGTTT TTCTTGATGT AACTCAACCG CTTGAGGAGA ATATTTGCCG TGTAACGGAG TCGGAGCATT CGCGTTTTCC TGTATGCAAC GGTAATCTTC AATCGCTACT TGGCGTGGTG AATGCAAAGC AGTTGTTGCT TAAAACCTTG CGCGGCGGTT TAACCGAATT TGCAACACTC TTGCAGCCAT GTGTGTATGT GCCCGAAACG CTTACGGGCA TGGAATTGCT TGACCACTTT AGAACTTCTG GTACGCAAAT GGTTTTTGTG GTTGATGAGT ATGGTGAAAT TCAAGGCTTA GTTACCTTAC AAGATTTGCT TGAAGCTGTA ACGGGTGAAT TTGTGCCGCG CAACCTTGAA GATTCATGGG CAGTTGAGCG TGCCGATGGT TCGTGGTTGC TTGATGGCTT AATTCCCGTG CCTGAATTAA AAGATACGCT CAAGCTTAAA GAGGTGCCCG ATGAGGATAA GGGGCTTTAC CACACCTTAA GTGGAATGAT TATGTGGTTG CTTGGTAGAA TGCCGCATAC GGGCGATGTG CTTGTTTGGG AAGAGTGGAA TTTGGAAATT GTTGACCTTG ACGGGCAGCG CATTGATAAA GTGCTTGCTT CACCACTCAA CAATGCACCA AAAGCATCTC AAAAAGAGGA AAAGCCGCCC GTCAAATCAG ACGATAATGC GGCTTTGTGT TCCATACGCC CAACGCCGTA A
|
Protein sequence | MEIFLLFVLI LVNGAFAMSE IALVTAKRSR LSRLADDGDK SATTAMKLGE DSTSFLSTIQ IGITSIGILN GIVGEGALAV PFSLFIHSAT GIELETAQLI ATVVVVLGIT YVTIVVGELV PKRLGQLNPE QIACLVARPM QILATITRPF GRLLSFSTNT LLRLMGVKPQ ITPSVTEEEI HAMLEEGSEA GVIEQQERDM VRNVFRLDDR QLGSLMVPRA DIVFLDVTQP LEENICRVTE SEHSRFPVCN GNLQSLLGVV NAKQLLLKTL RGGLTEFATL LQPCVYVPET LTGMELLDHF RTSGTQMVFV VDEYGEIQGL VTLQDLLEAV TGEFVPRNLE DSWAVERADG SWLLDGLIPV PELKDTLKLK EVPDEDKGLY HTLSGMIMWL LGRMPHTGDV LVWEEWNLEI VDLDGQRIDK VLASPLNNAP KASQKEEKPP VKSDDNAALC SIRPTP
|
| |