Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_0081 |
Symbol | |
ID | 4570648 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 95816 |
End bp | 97117 |
Gene Length | 1302 bp |
Protein Length | 433 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 639764683 |
Product | hypothetical protein |
Protein accession | YP_910575 |
Protein GI | 119355931 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.725699 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAACA AGGGTTGCCG GACGATCTAC GCGAAGGTAC TCGCCGCAAA TGACAACAGA AAGCAGCAGG TCTATTTCGG CGGTGATTTC CAGGCTATCA ATATTATTCC CTTTGACACG ATAGCTCCTG ATCCCGACAA GCCTCATATT TTCAAGGCAC CTCTCAATTT CTGGTGGCTT TCAGACGACG AAAGCTTGCA CAATGCTTCC CGAGCCCAGC TCATACTCTA TCCCCAATAT CCGGAAGTAA GATTTTCAGG TTTCCTCCAG GGCTGTTCCG CTGCTCCGTC GGAACTCATG GACGAGCGTC TGAGACTGGC AGGGCGGATC CTTTTTCTTG GAATATCTCC CGACGGCAGA ATTATTGGTT ACCTGTGCCA CCCTGAAAGT GAACTTGCGC GGGAGTTTGT TTCTCTCGGA GAACTGCCCC GAAGCGGGGT TTTTCTTGAA CCCGGTCTCG GAACCGGAGT CCTTGATGAT AGATCGTTAC TCATCGAGAA ATTGAGAGTT ATCCATCAGA AAGGCTGGAT CAGATCAAGA AAACTTGGCA GCAACGGTGT GATTTTGCCC TGTGAAGCTC CCAATTGCGG AGGGATGACA CTCGAAGCTG AACTGGATAT TATCCCCAAC AGCCGCTCGG AACCGGACTG GCTTGGCTAC GAGGTAAAGC AGTACAACGT CACCAACTTT CAGAGGATCA ACAGTGGCGT ACTGACTCTC ATGACCCCCG AGCCGACTGG CGGTTACTAC CGGAGCGCCG GTATTGAAGC GTTCATCCGG AAATTCGGCT ATCCCGATAT GACTGGTGCC GCAGATCGTG GCGATCGTCT GAATTTCGGT GGGATTCACA AAGTTGGCGA GTATCACCGT CTGACAAGCC TTCAGATTGT GCTTAAAGGG TTTGATGCCA TCAAGGGAAA AATCACAGAC GCAACGGGTG GAATCTCGCT GATGAACATT GAGGGTGAAG AGGCTGCTGT CTGGGGATAT GCTGAAGTTA TGGCCAAATG GAACCGGAAG CACAACAAGG CAGTCTACAT CCCGAGCCGG TGCGTTCAGT CACCCGAACG CCGTTACTGG TACGGAAACC TGATTCGAAT TGGTACAGGA ACAGATTTTT TGAAGTACCT GCAGGCTATG GCGGAAGGCA AGGTCTACTA CGATCCGGGC ATCAAACTGG AGAATGCCTC AACCACTCCG CGGACCAAAC AGCGAAGCCA GTTCAGAATA AAATCATCCA ATCTCCCGGC ACTCTACCAC AGCATGGATA TTGTTGATTT GAACGAGGAG CAGTCAGAAT AA
|
Protein sequence | MKNKGCRTIY AKVLAANDNR KQQVYFGGDF QAINIIPFDT IAPDPDKPHI FKAPLNFWWL SDDESLHNAS RAQLILYPQY PEVRFSGFLQ GCSAAPSELM DERLRLAGRI LFLGISPDGR IIGYLCHPES ELAREFVSLG ELPRSGVFLE PGLGTGVLDD RSLLIEKLRV IHQKGWIRSR KLGSNGVILP CEAPNCGGMT LEAELDIIPN SRSEPDWLGY EVKQYNVTNF QRINSGVLTL MTPEPTGGYY RSAGIEAFIR KFGYPDMTGA ADRGDRLNFG GIHKVGEYHR LTSLQIVLKG FDAIKGKITD ATGGISLMNI EGEEAAVWGY AEVMAKWNRK HNKAVYIPSR CVQSPERRYW YGNLIRIGTG TDFLKYLQAM AEGKVYYDPG IKLENASTTP RTKQRSQFRI KSSNLPALYH SMDIVDLNEE QSE
|
| |