Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_0675 |
Symbol | |
ID | 4569829 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | + |
Start bp | 771476 |
End bp | 772507 |
Gene Length | 1032 bp |
Protein Length | 343 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 639765273 |
Product | hypothetical protein |
Protein accession | YP_911154 |
Protein GI | 119356510 |
COG category | [C] Energy production and conversion [G] Carbohydrate transport and metabolism |
COG ID | [COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase |
TIGRFAM ID | [TIGR00661] conserved hypothetical protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.575182 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGATTC TTTTCGGTGT TCAGGGTACA GGAAACGGCC ATATCAGCCG CAGCAGAGAG TTGGTAAGGC GGCTGAAAGC TGACGGCCAT GAAGTTGACG TTATTATCAG CGGAAGAAAG GAGGATGAAC TCAGAGAGAT AGAGGTTTTT GAGCCCTATC GGGTCATGAA AGGGATGACG CTGGTGACCT TCAAGGGAAG GATGAACTAT ATTGAAACGA TGTTTCAGCT TGATCTTACA CGGTTGATGG CTGATGTTTT CACCCTCGAT ACAGAGGGAA CAGATTTGAT CATAACTGAT TTTGAGCCGA TTACCTCTCT TGCGGCAAGA CTCCGTAACA TTCCCAGTGT AGGATTTGGT CATCAATATG CCTTCAGATT TGATATTCCG GTTGCGAGAG GGAACATTTT TGAAAAATAC ACGTTGCTTA ATTTTGCTCC GGCCCGTTAC AATGCAGGAC TGCACTGGAG TGACTTCAAT CAGCCTGTTT TTCCGCCGGT CATCCCGGAA AGCCTCTACG AACAGAAGCA GCCACTCGTT AACCGCAGTA AAATTCTTGT CTATCTGCCG TTTGAAGAGC TTCCGGATGT TTCGGATTTT GTCTCTCCGT TCGTTGATTT CGAGTTTTTT ATCTATGGTA AAGTGCAAAA TGACAGCGAC GATGGTCATC TGCACTTCAG AGGGTATTCA AGAGCGGGTT TTTTGAAGGA TCTTATGGAG TGCGACGGGG TTGTCTGCAA TGCGGGATTC GAACTTCCCG GAGAAGCTCT CCATCTCGGG AAAAAATTGC TGCTGAGACC TCTCGACGGC CAGATCGAAC AGGAATCGAA CGCGCTTGCC ATGGTGGAGC TTGGATACGG TATGGCGATG CATAGCCTTG ACGGAGACAT GCTGGCCGAC TGGCTTCTGA AACCGGGCAG AGAACCGCTG CATTACGCTA AAACGGTGGA TTATATAGCC GAGTGGATAG GCAGCGGCGA ATGGGATCTT CTTTCTAAAT ATACTGAAGC AGCCTGGAAA AGGCTTTTCT GA
|
Protein sequence | MKILFGVQGT GNGHISRSRE LVRRLKADGH EVDVIISGRK EDELREIEVF EPYRVMKGMT LVTFKGRMNY IETMFQLDLT RLMADVFTLD TEGTDLIITD FEPITSLAAR LRNIPSVGFG HQYAFRFDIP VARGNIFEKY TLLNFAPARY NAGLHWSDFN QPVFPPVIPE SLYEQKQPLV NRSKILVYLP FEELPDVSDF VSPFVDFEFF IYGKVQNDSD DGHLHFRGYS RAGFLKDLME CDGVVCNAGF ELPGEALHLG KKLLLRPLDG QIEQESNALA MVELGYGMAM HSLDGDMLAD WLLKPGREPL HYAKTVDYIA EWIGSGEWDL LSKYTEAAWK RLF
|
| |