Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_1743 |
Symbol | |
ID | 4571105 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | + |
Start bp | 1969672 |
End bp | 1970742 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 639766326 |
Product | hypothetical protein |
Protein accession | YP_912184 |
Protein GI | 119357540 |
COG category | [S] Function unknown |
COG ID | [COG2307] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.331715 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTAAGCC GAGTCGCCGA ATCACTTTTC TGGATGAGCC GTTACTTTGA ACGGGCGGAG AATACCGCAA GGTTTCTCGA TGTCAATTTC AACCTGCTGC TCGATCTCAA CCAGATAACT GCGGTTGAAA ACCCCAATTA CTGGGTAGCG CTCATTCTCG TCACCAGTGA CAGGGAGAAG TTCAATGAGC TCTACTCCGA ATACAATGCC CATACGGTAA CCGATTATCT TGTATTCAAT AAAAGCAATC CGAACTCCAT CAGTTCGTGC GTGAGCCTTG CAAGGGAAAA TGCCAGAAGC ATCATTGAAA CCATTTCGAG TGAAATGTGG GAACAGGTCA ACAACCTCTA CCACTTTCTG CAGAACTCCA CCCCGATGTT TGTGCATAAC GACCCTTACA GCTTTTATAA AGAGATAAAA AACGCATCCC ATCTTTTTCA GGGTATCACC GACAACACCT TTTCACGCAA TGAGGGATGG GACTTCGTGC AGATAGGCAA GTACCTTGAG CGGGCCGACA ACATTGCCAG GCTGATTGAT GTGAAATATC ATATGCTGCT CCCGGAATAT CAGGATTCGA GCGACCCTGT CGAGGGATCG GTTGACATTA TTCAATGGAT GGCTCTTCTG AAAAGCTGCA GCGCCCTTGA GGCTTTTCGA AAAATTTATC TGTCAAAAAT AGATCCTGAC AATATCCTTC GTTTTCTTAT TCTCGACAGA ACATTTCCCC GAAGCCTCAA CTTCTCGGTC TGTGCCGCTG AAGATGCCTT GTCACGTCTT TCAGGAAGTA CACGACACCG TTTTAACAAC AATGCCGACC GCCTTATCGG CAAACTCGAA GCGGAACTCA GCTACACCAC AATCGAGGAG ATTTACGAGC AAGGGGTTCA CCGCTACCTT GAAGATCTCG AACAACGACT GGTAAAGGTT GGAGAACAGG TTCACCTTAT TTACTTTGCC TATCATACTC CCGAAATTGA GCCGCCGGAT ATCGATGAGG CCCTGCCCTT TACCGGAGTA GCCGGCGGAA GAGCAAACTG GAGCCAGGCA CAGCAACAGC AACAACAGTA G
|
Protein sequence | MLSRVAESLF WMSRYFERAE NTARFLDVNF NLLLDLNQIT AVENPNYWVA LILVTSDREK FNELYSEYNA HTVTDYLVFN KSNPNSISSC VSLARENARS IIETISSEMW EQVNNLYHFL QNSTPMFVHN DPYSFYKEIK NASHLFQGIT DNTFSRNEGW DFVQIGKYLE RADNIARLID VKYHMLLPEY QDSSDPVEGS VDIIQWMALL KSCSALEAFR KIYLSKIDPD NILRFLILDR TFPRSLNFSV CAAEDALSRL SGSTRHRFNN NADRLIGKLE AELSYTTIEE IYEQGVHRYL EDLEQRLVKV GEQVHLIYFA YHTPEIEPPD IDEALPFTGV AGGRANWSQA QQQQQQ
|
| |