Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_1047 |
Symbol | |
ID | 4571009 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | + |
Start bp | 1185859 |
End bp | 1187145 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 639765650 |
Product | hypothetical protein |
Protein accession | YP_911518 |
Protein GI | 119356874 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00366943 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCACATGA TGCCGCATCG CATTGAAAAG AATTTGGGCG GGGTAGCAGA GCTCTTTTTG CGAGTTTCAG TGACGGTCAT CACCATTGCG GTTTTTCTTG GAGCAATCGG CGCCTTGCTC GGTAATATTT TTTTGCTGCG TCTGCATCCC TATGTTTTTT TTATAGGGTT CGGCAACCTT GCCATTCTTA TTCTCAACAG GTATCTCACG GCTGTCATCT ACCCGGAGTT GAGAATAGAT CCTCATAAGC AGCTTCGCTA TATGTATGCC GTGCTGCTAT CACTGATAAG TATTGCCATT GCACTGTTTA TGGAGTGGCC TCTTCTGAAG GCCGCCACAG GCCTTTTGCT CATGGTTGTT GTTATGGGGC CGCTCAAGGA GATATTTACA ACTCTTTCCG TCAGCCGGAT ATGGAAGGAG GTTTCTGTAC GTTATTATAT TTTCGATGTT CTTTTTCTGC TTAATGCCAA CCTCGGACTT TTCACCCTTG GCCTGAAAGA GGCTTTTCCC GACCAGAAGA TTATTCCTTT CTTTGTTACG CAGTCAGCCT ATTTTCTCGG TTCCTCTTTT CCTCTCAGTA TCAGCGTCAT GGGATTTCTC TATACTTACG GCTGGCGTAC CTCTCCGAAA AGAGGGCTTA TCCGGCAGCT TTTCAGTATC TGGTTCTATG TTTTCGTCGG TGGTGTTCTC GGCTTTCTCA TTGTTATTCT CCTTGGTAAT TATTTGGGCA TGATGCTGAT CAGCCACCTT CTTCTTTTAG GCGTCATGGC TATACTCGGA GGTTTTGCCG CCTATCTCTA TGGCTTTTTC AAAAAGAACT TTCATCATCC GGCGCTTGCC TTTCTGTTAA GCGGGCTCTC CCTTTTATTA GCAACAAGCG CCTATGGCAT CATGAATGTT TACTTCATCA AGGGGATCCC TTTCGGATCC TATCCCCCTA TTCGCCTGGA TAAAATGTGG CTTTACCACT CTCATACCCA TGCGGCACTG CTCGGGTGGA TAACCTTCTC TTTTATTGGC ATGATCTATA TCGTCATACC TGCAATTTTC CGTTCGAACT CTCTTCAGTT TCTTCAAGGT TCCGGAGAGC TTTCTGAAAT GCTGCAGAAG AAGACGATGA AAAAGGCATT CAGGCAGCTT ACCATTATGC TCCTGTCGGC AACAGCAATC CTTCTTGCTT TTTTTCTTGA AAACCAGATA CTTCTTGGTC TCTCGGGTCT TCTGTTTGGT TGTTCGGTAT TTTTTGTAAT TATCAATCTT CGTTCTGAAC TCTACGAGGA AGAATAA
|
Protein sequence | MHMMPHRIEK NLGGVAELFL RVSVTVITIA VFLGAIGALL GNIFLLRLHP YVFFIGFGNL AILILNRYLT AVIYPELRID PHKQLRYMYA VLLSLISIAI ALFMEWPLLK AATGLLLMVV VMGPLKEIFT TLSVSRIWKE VSVRYYIFDV LFLLNANLGL FTLGLKEAFP DQKIIPFFVT QSAYFLGSSF PLSISVMGFL YTYGWRTSPK RGLIRQLFSI WFYVFVGGVL GFLIVILLGN YLGMMLISHL LLLGVMAILG GFAAYLYGFF KKNFHHPALA FLLSGLSLLL ATSAYGIMNV YFIKGIPFGS YPPIRLDKMW LYHSHTHAAL LGWITFSFIG MIYIVIPAIF RSNSLQFLQG SGELSEMLQK KTMKKAFRQL TIMLLSATAI LLAFFLENQI LLGLSGLLFG CSVFFVIINL RSELYEEE
|
| |