Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Clim_1610 |
Symbol | |
ID | 6354832 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium limicola DSM 245 |
Kingdom | Bacteria |
Replicon accession | NC_010803 |
Strand | - |
Start bp | 1741568 |
End bp | 1742794 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 642669211 |
Product | pentapeptide repeat protein |
Protein accession | YP_001943633 |
Protein GI | 189347104 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00000228674 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAAATA ATATCCGTTC GATTCTCCTT TTCTGCGCTC TTGTGCCGGC ATTGCCTGTC TCTGCCTCGG CCTTCAATAC CGCAGATTTC AATGCGCTGA AAACCGGCGT GAAACCATGG AACAGTTACA GGGCCGGGCT CGGCGGACGT GTCGCGGATC TTTCCGGGGC GCAGCTTAAG GGCATGAACC TGAGAGGGGC GGACTTGAGC TACGCCGATC TTTCCGGTGC CGATCTCGCC AGTTCCGATC TCAGTAAAGC CAGGCTCGAT CATGCCCGAC TCGACTCGGC CGTACTCCGT TCGGCTTTGC TGGTCAGGGC TTCGCTCGAT AAAGCGCGGC TCCATAATGC CGATCTTGAA GACGCAGTGC TTGAAGCCGC TTCGTTCAAA GGAGCCTTTA TGCAGACCGC GGTACTCAAG AAGGCAGACT GCACCGGCGC CGATTTCAGC GGTGCCGATC TCCGGGAAAC GAATTTTCGT GAAGCGAGGC TTGCCGGTGC ACTGCTAACC GGTGCTGATC TGCGAGCGAC CTACCTCTGG CGGGCCGACA TGAGCAGGTC GGTATTGAGT GGTTCCAGGG TTTCGCCGTC CACGGTACTG GCTTCAGGCA GCTATGCTTC GCAGGAGTGG GCTTCGGAGC ATCGCGCGGA GTTCCTCAAC GATGCACCTG AGCCCGTTCA GGTTGTCGCA GGAATGGCTC GTACCGGTCA GCCTCTTCCA ACGGTAACGG GAAACGTCGG AAACTCTGAG TCGGCAAAAA GCGGCTCGAA GGGAAACCTG TGGCGCAACT CCGGTTCCGG CGCAGCTGTC GTTTATGATC AGGTGCTCTA TAAAAAGCTG AAATCCGGGG TATTCGCATG GAACGATATG CGCAAGCGCA ACCGGGCCAT GGAAGTCAAT CTCCGTCAAG CGAAATTCGA TCAGAAAAAT CTCAGTTATG CCGATCTTGC CCATGCCAGG CTGCAGGGAG CAAGTTTCAG GAAGGCCGAT CTTTTCGATG CCGACCTTCG GAACGCCGAT CTTTCGGGAT GCGATATGCG CGAAGCGAAT CTTGAAAAGG CCGATCTGGG AGGAGCCGAT CTTTCCGGTG TGAATCTCTG GCGGGCGAAT CTCGGCCGCG CGCGTCTTAA CGGCGTTAAG GTTTCCGCCT CTACTGTTCT CGATACCGGC AAAAAGGCTG ATCAGAAGTG GGCTGAACGG CATGATGCCG TATTTATTCA TGAGTAA
|
Protein sequence | MQNNIRSILL FCALVPALPV SASAFNTADF NALKTGVKPW NSYRAGLGGR VADLSGAQLK GMNLRGADLS YADLSGADLA SSDLSKARLD HARLDSAVLR SALLVRASLD KARLHNADLE DAVLEAASFK GAFMQTAVLK KADCTGADFS GADLRETNFR EARLAGALLT GADLRATYLW RADMSRSVLS GSRVSPSTVL ASGSYASQEW ASEHRAEFLN DAPEPVQVVA GMARTGQPLP TVTGNVGNSE SAKSGSKGNL WRNSGSGAAV VYDQVLYKKL KSGVFAWNDM RKRNRAMEVN LRQAKFDQKN LSYADLAHAR LQGASFRKAD LFDADLRNAD LSGCDMREAN LEKADLGGAD LSGVNLWRAN LGRARLNGVK VSASTVLDTG KKADQKWAER HDAVFIHE
|
| |