Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_1715 |
Symbol | |
ID | 4571075 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | + |
Start bp | 1943652 |
End bp | 1944845 |
Gene Length | 1194 bp |
Protein Length | 397 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 639766298 |
Product | appr-1-p processing domain-containing protein |
Protein accession | YP_912157 |
Protein GI | 119357513 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00091392 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAAAAAAC ATGGTCAAAC CGTATTTCTA ATAACCATCA ATCCGGTCGG CGTGATGGGT AAAGGGATCG CCCTGCAGTT CAAACATGCT TTTCCCGAGA ATTTCAAAGC TTATGCCGAT GCGGTGAATC GCAAGCAGAT CAGAACAGGC GAGGTTCAGG TTGTCCCGGT TTCATCACTG AACGGCGTAC GGTACATCAT CACCTTTCCC ACCAGGAACC ACTGGCGTTA CCCATCGAAA CCGGAATGGA TAACAGCCGG GTTGCGCGAC TTGCGAAAAA AAATAGAGGA CTACCAGATC GAGTCGATAG CGATTCCTCC TCTCGGATTC GGCAATGGCG GGCTGGACTG GAGCGTGGTA AAAGCTGAAA TCGAAAGCGC TTTACAAGGT CTTCCCGTCG AGATACAGGT CTATGAGCCA TCCTCCGCCA TCAGGGATCT GCTGGTAAAA GAGGACAAGC CTGCAGCTGC ACACCTCACA CCTGTCAGGG CCATGCTCCT GCTTCTGCTC TATCGTTACC GGGCCATGGG AGAACACGCC AGTGAATTCG CAGCCGAAAA ACTCAGCTAT TTCCTGCAGC GGGCTGGAGA AACACAATTG AAACTCGAAT TCACCAAAGG GTATTACGGA CCGTATTCCG GTAAGGTTCG TCACGTGCTA TACGCCCTGA ACGGATATTA CCTGAAGGGT TTCGAACAGA AAGAAGCGAA ACCTTTCGAG CCGTTTGACA TCATTGTCGA GCGGAGTGAC GAAGTACTTG ACTACATCCA GAACAAACTG AATCCGGTCG AGAAAACCCA TCTCGATAAA GTCCTGAAAC TGATCAAGGG ATTTGAATCG CCTTATGGAC TCGAACTGCT TGCTACGGTT GATTATCTCA TTATCGAAAC CGGCAACAGT GACCCGCAGG TTCTGTCCGG CGCAATCAGG CAGTGGTCAG CAAGAAAAGC CGATATGTTT CCTCCTGAAC ATGTGCGCCT TGCGTCAGAA CAACTGCATC TGCTGCGAAA TCAGTCCCTG CCAACTGCTT CGCATTTCGG TATCGGGGAA CGGAATAATC AATTAAAAAA TATAACGATA GCCACTATCA GGTGGCCTGT TCCCGAAATC TGTTTCGGGA ACATCAGTAA GCCACGAACC GCCATAATCT GCCGAAACCG TTTTTTTGTA ACGTCACATC AGAAATACCA ATAG
|
Protein sequence | MKKHGQTVFL ITINPVGVMG KGIALQFKHA FPENFKAYAD AVNRKQIRTG EVQVVPVSSL NGVRYIITFP TRNHWRYPSK PEWITAGLRD LRKKIEDYQI ESIAIPPLGF GNGGLDWSVV KAEIESALQG LPVEIQVYEP SSAIRDLLVK EDKPAAAHLT PVRAMLLLLL YRYRAMGEHA SEFAAEKLSY FLQRAGETQL KLEFTKGYYG PYSGKVRHVL YALNGYYLKG FEQKEAKPFE PFDIIVERSD EVLDYIQNKL NPVEKTHLDK VLKLIKGFES PYGLELLATV DYLIIETGNS DPQVLSGAIR QWSARKADMF PPEHVRLASE QLHLLRNQSL PTASHFGIGE RNNQLKNITI ATIRWPVPEI CFGNISKPRT AIICRNRFFV TSHQKYQ
|
| |