Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_1241 |
Symbol | |
ID | 6374918 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | + |
Start bp | 1346825 |
End bp | 1348063 |
Gene Length | 1239 bp |
Protein Length | 412 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 642683739 |
Product | pentapeptide repeat protein |
Protein accession | YP_001959654 |
Protein GI | 189500184 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0335657 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAGCT TCTTGAGAGC GGCCTGTCTG ATCGTTCTGG CGGGAGTTTT TCTTCCGTCA ATCAGCGCTG CCTATGACAG CGGAAGTCTG ACGCTCATCC GTAAGAGTGT CACGTCATGG AACAGCATGA GAGAGAACTA TCCTGAAGCA GCGATCGATC TCAGCGGGGC GGACCTGAAA GGCCGGAATC TTAAAGGCGC TGATCTGCAC AACGCCAATC TTCAGGGTGC GAATCTTCAC GGTGCCGATT TGAGCGATAC CGATCTTCGT GGGGCATCTT TTGACCATGC GTCACTGAAG GGCGCGCTGC TTTTCGATGC CGATCTTCGT GAAGCCACTG TACGCGAAGC CGATCTTGAG GATGCCGCTT TCGAAGGCGC CGATCTCAGA GGTGCCGTGC TTGACGGCGC GGTGATGAAA CAGGCGGATC TTGGTGAATC CAATCTTCGA AACGCCAGTC TGAGAGGAAC TGATCTGCGG GCGGCAAACC TGAAAATGGC GGATCTGGCC GGTTGTGATC TGAGTGGAGC ATACCTGTGG AGGGCAGTAC TTGACGGGGC AAATCTTGAG AACAGTGTCG TGACATCGGT CACTATCGTT GAAACCGGTC GTTCCGCCGA TCCGGAATGG GCTCAGAAGA ACGGAGCAGT GCTTGCCATG TCCGAACCAG CCCGGCAAAA GGAGGGTGCT GCTGAAGCGG AAAGCGAGAA TACAGTGACA GAGTCGATTC TTGCTCAAAA AACCTGGCCG ATAAATCCTG TGGTGCAGAA AATCCGTTTC GGCGTGGAGA GAAAAGATGC TGCAACGCTA TCATACGACG TTCATCAGCG GGAGTTGTTG ATAAAAAGCG TCTCAAAATG GAACAGGATG AGGGAGACGA ACCCTGATGC TCCGGTTCGT CTTTCCGGAG CAAAATTAAG CAGGAAAGTG CTCGATGGAG CGGATTTGCG GGATGCCGAT CTTGCAGGAT CCCTGATGAA AAGAACAGGG TTGGCCGATA CTGATCTGAG GAATGCCGAT CTCAGGGAGG CAAATCTCCG TGAAGCGGAA CTGACAAATG CCGATCTTCG GGGGGCGGAT TTGAGGGGAG CCTACCTGTG GAGAGCGAAT CTGAGCTGGA CGAAAATCGC AGGGATACGT GTCAATTCGC ATACTGTATT CGATGATGGA AAGAATGTTA CGCCTGCATG GGCGAAAAAA AGAGGCGCAG TGTTCATGGA CCGGGACATG GAAGAGTAG
|
Protein sequence | MNSFLRAACL IVLAGVFLPS ISAAYDSGSL TLIRKSVTSW NSMRENYPEA AIDLSGADLK GRNLKGADLH NANLQGANLH GADLSDTDLR GASFDHASLK GALLFDADLR EATVREADLE DAAFEGADLR GAVLDGAVMK QADLGESNLR NASLRGTDLR AANLKMADLA GCDLSGAYLW RAVLDGANLE NSVVTSVTIV ETGRSADPEW AQKNGAVLAM SEPARQKEGA AEAESENTVT ESILAQKTWP INPVVQKIRF GVERKDAATL SYDVHQRELL IKSVSKWNRM RETNPDAPVR LSGAKLSRKV LDGADLRDAD LAGSLMKRTG LADTDLRNAD LREANLREAE LTNADLRGAD LRGAYLWRAN LSWTKIAGIR VNSHTVFDDG KNVTPAWAKK RGAVFMDRDM EE
|
| |