Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cag_1728 |
Symbol | |
ID | 3746511 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium chlorochromatii CaD3 |
Kingdom | Bacteria |
Replicon accession | NC_007514 |
Strand | - |
Start bp | 2244347 |
End bp | 2245516 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 637774265 |
Product | pentapeptide repeat-containing protein |
Protein accession | YP_380022 |
Protein GI | 78189684 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.952308 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGGCTAA CGTTTCCGCC AGCCGTTTCT TTACAATTCT TTAATATGAA CAATCAAAGC ATGATCTCCT CTTTTTTTCA ACGCATGGCG TTTACGGCAC TTGTGGCAAG CGCACTACCA TCATCTCTTT TTGCTTACGA CCGTGCTCAT GTAACCTTGT TGCAACAAGG TGTTGCGGTG TGGAATAACC AGCGTCAAGC AACCATGGGG CAAACGCTTG ATTTATCGCG AGCACCGCTT GCTAAGGCAC AACTTGGCGA GGCGAATTTA GCGCATGTGT CGTTAAGCAG TGCTTTTTTA CAAGCAGCAA ACTTGCGTGG GGCTAATTTG CAAGCGGCTA ATTTGCGTTG GAGTGTGCTT GATGGTGCCG ATTTGCGCGA TGCGGTGCTG GTGGGTGCTC ATTTATTTGA AGCAAGCTTG GTGAAAGCAG ATGCTCGTGG CGCTAATTTT AAGAGTGCAA CATCGTTGGA GCAAGCTGAT CTTTCAGGCG CACTTGTTTC CAATAACACC ATTGTGCCAT CAGGTGAACG CGCTCACGGG CAGTGGGCGT TACGCCATCA TGCTACGTTT GTGCAGGAGC CTGAGCGCCC GATTGCCTCT ATTGCTTCAG CCATCTCATT TTCTCCCGAA CGTACCATCA CCTCACCGCC TAACTCTGCG CCAACAACGG TATCTCAATC TGCGGTAACT GCTCACCCCT CAAACGTTGT GCCATCACCA CAAGCATCAG CGCAAGCGCC AATAACGAAA GAGTATGCCC GTGCTACGCT GAACGGGGTC AATTGGAGCA ATGCGGATTT AGCAGGTGCT AATTTTTATA AGGCTGATAT GAAAGGTGCC CAGCTACAAG GTGCAAATTT GCAAGGCGCT CATTGTGATC GGGCGTTTTT GCTTCAAGCC AATTTACAAG GTGCTAACCT TACGAAAGCA CTGTTGTTTG GCGCTACACT CGACAAGGCT GATTTAAGAA ATGCTAATTT AACGGAAGCG TCGCTTTTTG GGGCAAATTG TGAGGGAGCT GATTTGCGTG GAGCTATTTT AACGAGGGCA AACGTAACGG ATGCGGTGTT GACTAATGCG CTTATTTCTT CCACCACCGT GCTGCCATCA GGTAAAGCGG CAACACGGCA GTGGGCGCTC ATGCAGCAAG CTATCTTTAG TCAGGATTGA
|
Protein sequence | MRLTFPPAVS LQFFNMNNQS MISSFFQRMA FTALVASALP SSLFAYDRAH VTLLQQGVAV WNNQRQATMG QTLDLSRAPL AKAQLGEANL AHVSLSSAFL QAANLRGANL QAANLRWSVL DGADLRDAVL VGAHLFEASL VKADARGANF KSATSLEQAD LSGALVSNNT IVPSGERAHG QWALRHHATF VQEPERPIAS IASAISFSPE RTITSPPNSA PTTVSQSAVT AHPSNVVPSP QASAQAPITK EYARATLNGV NWSNADLAGA NFYKADMKGA QLQGANLQGA HCDRAFLLQA NLQGANLTKA LLFGATLDKA DLRNANLTEA SLFGANCEGA DLRGAILTRA NVTDAVLTNA LISSTTVLPS GKAATRQWAL MQQAIFSQD
|
| |