Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_0893 |
Symbol | |
ID | 4570507 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 1017939 |
End bp | 1019012 |
Gene Length | 1074 bp |
Protein Length | 357 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 639765488 |
Product | pentapeptide repeat-containing protein |
Protein accession | YP_911365 |
Protein GI | 119356721 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.417756 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAAGCC CAAAACATCT TGAAATCCTG AAACAGGGCG TTGAAGTATG GAATGAGTGG CGTGATCAGC ATAAAAATGT TATACCTGAT TTTAGAGGTG CCGATCTAAA ATTCATTAAT CTTGCTAACG CCAATCTCTC TATAGCAAAT CTCAGAACAG CTAAATTCTC ATATACAAAT CTTACAAGAG CAAATTTATC AGGTTCTAAT CTTGCTGATG CCAATCTTAC AGGTGCCAAT CTAACGGGAG CAAATCTATC GAGATGCAAT CTTTCTATAG CCAATCTTTC AACGGCCAAT CTTTCAAAAG CTAATCTTGA AGGAGCTATT CTTATAGATG CTGATCTCAC AAGGGCTAAT TTTAGAGAAT CTAATCTCAT ATTTGCAAGT TTATCAGGAA GTGCCCTCAT AAAAACTGAT TTCAGTAACG CAAGTGTCGG ATGGACAATC TTTACTTATC TTGATCTTAG CCCCTTATAC AATTGTGCTA TCGGTCTTGA GACAATAATT CACAAAGGAC CTTCTTCTGT AGGAATCGAT ACGATATACC AATCAAATGG AAATATCCCT GAAGTCTTTC TTCGCGGGTG TGGTCTTTCC GATGAATTTA TTGCCTATAT CCCATCTTTA ACCGGAAAAG GTATCGAATA CTACTCGTGC TTCATCAGCT ACAGCCATAA AGATGAAGTT TTTACTAAAC GGCTGCATAA CGACTTGCAG GCAAACGGTG TGCGCTGCTG GTTTGCTCCG CATGACATGA AAATCGGAGA TAAAATCCGC CCAACCATTG ACGACTCAAT CCGAGTTCAT GACAAGCTGC TTCTGATCCT TTCAGAACAC TCAGTGCAGA GTGATTGGGT TGAGCACGAA GTTGAGCATG CTTTTGATCT TGAGAAAGAA CGAAAACAAA CAGGACTTTT CCCGCTTCGA ATCGATGAAT CAATCATGGA GAGTACAACC GGATGGGCAG GAAATGTGAA GCGCCAGAGG CACATCGGAG ACTTCACAAA ATGGAAACAG CGCGACGCCT ACCAGGCCGC ATTCGACCGC CTCTTGCGTG ATTTGAAAGC CTGA
|
Protein sequence | MASPKHLEIL KQGVEVWNEW RDQHKNVIPD FRGADLKFIN LANANLSIAN LRTAKFSYTN LTRANLSGSN LADANLTGAN LTGANLSRCN LSIANLSTAN LSKANLEGAI LIDADLTRAN FRESNLIFAS LSGSALIKTD FSNASVGWTI FTYLDLSPLY NCAIGLETII HKGPSSVGID TIYQSNGNIP EVFLRGCGLS DEFIAYIPSL TGKGIEYYSC FISYSHKDEV FTKRLHNDLQ ANGVRCWFAP HDMKIGDKIR PTIDDSIRVH DKLLLILSEH SVQSDWVEHE VEHAFDLEKE RKQTGLFPLR IDESIMESTT GWAGNVKRQR HIGDFTKWKQ RDAYQAAFDR LLRDLKA
|
| |