Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cag_1809 |
Symbol | |
ID | 3746924 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium chlorochromatii CaD3 |
Kingdom | Bacteria |
Replicon accession | NC_007514 |
Strand | + |
Start bp | 2330964 |
End bp | 2332094 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 637774347 |
Product | hypothetical protein |
Protein accession | YP_380103 |
Protein GI | 78189765 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.895599 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCATCAA TCGCCCCCAT AATTCCATTC TTGCTGCTCT TTTTGTGGCT TTTGCAAGGG TGCGCTTCCG ATAGAGCACC ATCAGGCGGT AGTGCCGATA CCACGCCTTT GCGCTTATTA GCATCAACTC CCATAAATGG CACACAAAAT TTTAAAGGCA ACCAGCTTCA GCTTTACTTT AGCCACGAAG TGAGCAGTCG TGCACTGTTA CGCGCACTTC GCACATTCCC TGATATAGGA CAATTTGAAC TGACGGTAAA CGGCAAACGC GCCGATATTC AGTTACTGGA TACTTTACAA GCCAACCAAA CCTACACCTT GCTGCTGAAT CGCCACTTGA ACGACTTTCG TGGGCAGTTG CTCCACGCGC CAACCACTCT TGCGTTTTCA ACAGGCAACA ATGTGAATAA TGGCACCATA CGTGGCACAG TAGTACAGTA TAACGGCACA CCAGCCAGCA ACGCTTTACT CCTTGCCTTT GCAAGCGCCG AAAAAGGGGC AACGGTTAAT TTGTTGGAAA ACAAGCCAAC ACAGATAGCA CAATGCGATG CTTCGGGCAG CTTTGCGTTT AACCATTTGC CGCATGGCAG CTACCACGTA GTAGCCATCA ACGACCGCAA CCACGACCTT GCATGGGCAC CAAGCAGCGA AGAGTACGCC ACTCCAAGCC AGCCACTTAT GGCAACAAAC AGTGCAAACC AACTATTGCG CCTTTCGCCT CCACTTAAAA GCCCAAAGCC GCTCAAAATC CCTTTGGAAG CCTCTTCAGC CCCAACAAAT TCAACGATTG CAACAGGTAG CCTGAGCGGC ATGTGTACGG TACGTGGCAA TCCACCAAGC GTAATTATTG AAGCTATCTC GCCATCAGCC ACTTACTACA CCGTTGCGGT GCGCAAAAAA GCAGGCAGCT ACACCTACCA TTTTAACCAA TTACCCGTTG GAGACTACAC GATTACCGCT TCCATTCCAA CCGCAAGCTA TCAGCCAAAC CAAGCATGGC AATGGAATGC AGGTTCAGTA GCACCTTTTG TGCCCTCTGA TAGTTTTACC TTTTATCCTG AAACCGTTAC CATTCGAGAA GAGTGGCTTA CCGAACGCAT TAACATTACC TTTCCTACCA TTTTACAGTA A
|
Protein sequence | MPSIAPIIPF LLLFLWLLQG CASDRAPSGG SADTTPLRLL ASTPINGTQN FKGNQLQLYF SHEVSSRALL RALRTFPDIG QFELTVNGKR ADIQLLDTLQ ANQTYTLLLN RHLNDFRGQL LHAPTTLAFS TGNNVNNGTI RGTVVQYNGT PASNALLLAF ASAEKGATVN LLENKPTQIA QCDASGSFAF NHLPHGSYHV VAINDRNHDL AWAPSSEEYA TPSQPLMATN SANQLLRLSP PLKSPKPLKI PLEASSAPTN STIATGSLSG MCTVRGNPPS VIIEAISPSA TYYTVAVRKK AGSYTYHFNQ LPVGDYTITA SIPTASYQPN QAWQWNAGSV APFVPSDSFT FYPETVTIRE EWLTERINIT FPTILQ
|
| |