Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_0823 |
Symbol | |
ID | 4570321 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | + |
Start bp | 936297 |
End bp | 937367 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 639765421 |
Product | pentapeptide repeat-containing protein |
Protein accession | YP_911298 |
Protein GI | 119356654 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000000611936 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAAATC CGGAACATCT GGCAACGCTG CATAAAGGTG TTGAAACCTG GAACCGATGG AGGCAGGAAG AGCGGGTGCT TCGCCCCGAT CTCAGCGGCG CTGATCTGTG GGGCGCGAAT CTCAGAGGGG TGAATTTCAG CGGTTCGGTT CTTCGCGGAG CAATACTTAC CGATGCTGAT CTGGGTTCGG CGGATCTTCG GGGAGCCGAT TTGAGCGAAG CAAACCTCGG AGGAACTGAT TTGAGTAAAA GTACTATCGA TACAGCTACC CGGTATGAAA CGGTTATCGG TTGCGATGTC GGTGTGAACG GGTTTTATTC ACAGGCAACC GACTCCGCTG CCCTCATGCG TATCGATCCT CCAGGAAACT CCATGCAGGG CGCCAATGTC AACGCTGTTA TTGAAAGTCT GAACGTCGCC AGAAAACTGC ACACGTTTTC CCTGATTCTG GCCGGCATCG CCCTTCTTTT CATTGTCATC AAGCCCAAAT CGATAACCTT GCCCTATCTT GCAGGTTCCT TCAGGTTTGA TGCGCTCAGT TACGCTTTTC TTGCCACCAT TCTTTCGACA GGAATGTTGA TGCTCGTGGC GACCTTTATT GATTCGGCGC TGCAGGGCGT GCGCTATCTG AACGATCGCA GATCAGCCAT GACGGTTGCG CACTTCCCCT GGTTGCTCTC GAAATACGAG CATGATCCCG GTGTAAGCCG CAAATCGAGG GTCATGCGTT TTCTGCTCAG CTATCATCCT GTTGTCTATC TCTATTTTTT TGTAAAATGG GAGTCGGTTT TCACAGGAGA CTGGGAGGCC ATTGCCCGGC ACTATATGGA GCTTCCGGTC ATTCTTGCAG AAGTGCTGCT GCCGTTTTTC TTTGTTATGC TGATGATGCT CTGCAGGAAC ATCTACCGTC TTTCGGAGGG TTTTCAGAAA CCTATTCTTT TTGATACCGC AACTGAACGT GACCGGCGGA GCGACATGGA ACGACTTGCC GAGGCTGTAG AAAAACAGTC ATCAACTATC GGGGTTCTCA TACAGCTTCT CGAAGAGAAA CGGGGAGTGA AGGAGAGATA A
|
Protein sequence | MANPEHLATL HKGVETWNRW RQEERVLRPD LSGADLWGAN LRGVNFSGSV LRGAILTDAD LGSADLRGAD LSEANLGGTD LSKSTIDTAT RYETVIGCDV GVNGFYSQAT DSAALMRIDP PGNSMQGANV NAVIESLNVA RKLHTFSLIL AGIALLFIVI KPKSITLPYL AGSFRFDALS YAFLATILST GMLMLVATFI DSALQGVRYL NDRRSAMTVA HFPWLLSKYE HDPGVSRKSR VMRFLLSYHP VVYLYFFVKW ESVFTGDWEA IARHYMELPV ILAEVLLPFF FVMLMMLCRN IYRLSEGFQK PILFDTATER DRRSDMERLA EAVEKQSSTI GVLIQLLEEK RGVKER
|
| |