Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Clim_0537 |
Symbol | |
ID | 6354888 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium limicola DSM 245 |
Kingdom | Bacteria |
Replicon accession | NC_010803 |
Strand | + |
Start bp | 604962 |
End bp | 605978 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 642668173 |
Product | pentapeptide repeat protein |
Protein accession | YP_001942608 |
Protein GI | 189346079 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAGATC AGGAACATCT GACCGTTCTT CGGCAAGGAG TTGCATCATG GAACAGGTGG CGCCTTGAAA ACAGCGGTAT TCAGCCCGAT CTGAGCGGTG CCGACCTGCG GGGCCGCGAA CTTCAGGATG CGGATTTCAG CGGTACCGAC CTTCGCGCCG CCGATCTCAC CGGAGCCGAT CTTCGTGGTG CAAGGCTCAG CAAGAGCACT ATCGATATTC ATACCCGTTA CGATACTATC CGGGGTTGTG ATATCGGAGT GAACGGATTC TATTCTCCGG CTACCGATTC CGCAGCCCTC ATGCGTCTCG ATCCTCCGGG AAACTCCATG CAGGGGTCCA ATGCGGAGGC TGTGATCGAA AGTCTCAAGC ATGCCAGAAA ACTGCATACC TTTTCCATGA TTCTGGCCGG TATCGGTCTT TTGTTTATCG TCATCAGGCC TAAATCCATT TCCCTTCCAT ACCTTGCCGG ATCGTTCAAG TTCGACGATC TCAGCTACGC TTTTCTTGCT GCGCTGCTCT CCACCTCTCT GCTCAGTCTT GTCGCGACCT TTATCGATTC CGCACTGCAG GGGGCGCACT ATCTCAACGA CCGCCGTTCA GCCATGACGG TAGGTCACTT TCCCTGGTTG CTTTCCAAAT ATGAACAGGA GGGGGCATTC AGACGCCAGT CTAAAGTCAT GCGTTTTTTT CTCAGTTTTC ATCCGCTGGT TTACCTGTAC TTTTTTGTCA AATGGGATGC CCTTTTTCTT GGCGACTGGT ACGGAGTGAT AAGGCACTAT CAGGAACTTC CGGTTATTCT CGGGGAGTGG CTTCTTCCGG TTTTTCTGGT CATTCTTGTA CGGCTCTGTA TGAAAATTTT CAGACTCTCG GAAGGATTTC AAAAACCTAT TCTTTTCGAT ACGGTAACGG AGAGAGAACG GCGTACCGAT ATGGAGAGGC TTGCCCAGGC AGTCGAAAAA CAGGCTGTGG AAATCTCGGC ATTGACCGCA CTGCTGCGCC GGGAAAAAGA ACGGTAG
|
Protein sequence | MADQEHLTVL RQGVASWNRW RLENSGIQPD LSGADLRGRE LQDADFSGTD LRAADLTGAD LRGARLSKST IDIHTRYDTI RGCDIGVNGF YSPATDSAAL MRLDPPGNSM QGSNAEAVIE SLKHARKLHT FSMILAGIGL LFIVIRPKSI SLPYLAGSFK FDDLSYAFLA ALLSTSLLSL VATFIDSALQ GAHYLNDRRS AMTVGHFPWL LSKYEQEGAF RRQSKVMRFF LSFHPLVYLY FFVKWDALFL GDWYGVIRHY QELPVILGEW LLPVFLVILV RLCMKIFRLS EGFQKPILFD TVTERERRTD MERLAQAVEK QAVEISALTA LLRREKER
|
| |