Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_1474 |
Symbol | |
ID | 4570244 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 1672249 |
End bp | 1673382 |
Gene Length | 1134 bp |
Protein Length | 377 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 639766060 |
Product | hypothetical protein |
Protein accession | YP_911925 |
Protein GI | 119357281 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTCAAC CCTCAATGTT TAATCATATG ATGACTCGCA GACGGTTTAT TATCTCCTCC ATGGCGGCTA TTGGCGGTTT GGCGACCGTA TCGGCATGTT CACCAGAAAA AAATCCCGGC AGCTACATGG ATGTCGCAAG CGGTATCTGG CGTCACGGAA AGGTGGAGGC CGGGAACAGG TCTGCGGTTC TGCGTGAGCT TGTGCGATAT GCGACACTGG CGCCATCAAG CCACAATACC CAGTGCTGGA AGTTTCGCAT TGATGATCGT TCTATCTCGA TTTTCCCCGA TTTCTCGCGC AGATGCCCCG TAGTCGATCC GGACGATCAT CACCTGTTTG TATCAATCGG GTGTGCGATG GAGAACCTTA TACAGGCGGC ATCAGCAAAC GGGCTTGATG GTAATGCGGT TTTCGATCCG TCCTCGCGCG GGAATGTGCG TGTTTCGCTG GAACCAACGA ATGCCGTTGT TACCCCTCTG TTCAAAGCGA TACCGGAGCG CCAGAGCACA CGAGCCGAGT ATGACAGGAA GCCGATTTCC ATGAATGAGC TGGCGATGCT GGAAAGGGAG GGTACAGGCA AAGGTGTCCG GATTATTTTT CTCACTGAAC GCGCGGCAAT GGAGAGCCTG CTCGATTATG TTGTTCAGGG TAATACTGCG CAAATGAACG ACAGCGCATT CGTTGAAGAG CTGAAAGCAT GGATACGCTT CAGTGAGAGC GATGCGGTAC GCAGAGGAGA CGGCCTGTAT TCTGCCTCGT CGGGGAATCC ATCCGTGCCT TCGTGGCTGG GCAGCCTCCT GTTCGGTTTG TTCTTTACAG AGAAGAACGA GAACGACAAG TATGCGAAGC AGGTGCGCAG TTCAGCAGGT ATCGCGGTGT TTGTATCGGA GGGCGAGAAC CCTGAGCAAT GGATAGAAGT CGGGAGATGC TACGAACGGT TTGCGCTTCA GTGCACGGCT TTGGGGATAC GCAATGCCAT GCTCAATCAA CCGGTGGAAG TTGCTGCACT GAGGCCGCAG TTCGGGGCCT TTCTCGGTAT CGGGGAGCAC AGGCCGGATC TGGTCGTACG ATTCGGACGT GGTTCGGGAT TGCCGCAGTC ATTGCGACGT CCGGTTGAAG ATGTTCTGGC ATGA
|
Protein sequence | MPQPSMFNHM MTRRRFIISS MAAIGGLATV SACSPEKNPG SYMDVASGIW RHGKVEAGNR SAVLRELVRY ATLAPSSHNT QCWKFRIDDR SISIFPDFSR RCPVVDPDDH HLFVSIGCAM ENLIQAASAN GLDGNAVFDP SSRGNVRVSL EPTNAVVTPL FKAIPERQST RAEYDRKPIS MNELAMLERE GTGKGVRIIF LTERAAMESL LDYVVQGNTA QMNDSAFVEE LKAWIRFSES DAVRRGDGLY SASSGNPSVP SWLGSLLFGL FFTEKNENDK YAKQVRSSAG IAVFVSEGEN PEQWIEVGRC YERFALQCTA LGIRNAMLNQ PVEVAALRPQ FGAFLGIGEH RPDLVVRFGR GSGLPQSLRR PVEDVLA
|
| |