Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_1932 |
Symbol | |
ID | 4570046 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 2239784 |
End bp | 2240863 |
Gene Length | 1080 bp |
Protein Length | 359 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 639766514 |
Product | glycosyl transferase family protein |
Protein accession | YP_912372 |
Protein GI | 119357728 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.506489 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTGAAC CCGAACGGCT CCGCCCCGCC GTTGACATTA TCATTCCCCA TTTTCGGGGA CTGGAGATGC TTGAACGCTC TCTTGAGTCG CTTGAAAAAA CCCGTTATCC CTCTATGGGA ATTATTGTTG TTGATAACGG AGGCGATCAG GCCGGGCTTG TTTTTCTTGT AAAAAGATTC AGAAATGCAC GGCTCCTGCG ACTCAGGGAA AACAAGGGCT ATGCCGGTGG TTGCAACGAA GGGCTTCGTT TTTCATCGGC TGAATATCTT GTTTTTATGA ACGACGATAC CGAACATGAT CCGTTCTGGC TTGAGCACCT TGTTCAGTCT GCCGAGGCTG ATAAGGGTAT CGGTGCGTTG CAGCCGAAAA TCCTGTCGCT CAAGGCCTGG CGGAAACGGC AAAGGGTTTT CGATTATGCC GGAGCAGCAG GAGGTATGAT AGACCGTTTC GGGTATCCCT GGTGTCTCGG CAGAAATTTC ATGCGCATCG AGCAGGATTC CGGTCAGTTT GACAAGTCGC AGGAGATTTT CTGGGCATCG GGAGTAGCTC TTTTTGCTCG ACGAAGCGTT ATTGAGCAGG TTGGCGGATT TGATGAGCGT TTTTTCATGC ACATGGAAGA GATTGATCTT TGCTGGCGCA TGAAACTTGC CGGATTCAGT ATCAGATCGC AGCCATTGTC GGTTGTTTTT CATGAAGGCG CAGCATCGAT GCCGGAGGGT TCTGCTGAAA AAATTTTCCT CAATCATCGA AACAATATCA CCATGCTTCT GAAAAACCGG GGAGCTCTGT CGCTTTTACC GGTTGTGTCT GTGCGGCTTT TTCTTGAGTT TGCCGCAGCG TTGTTTTATC TCATGCAAGG TTCCGGCGGA GTGAAAAAAT TCCGGGCGGT TTTCAGGGCA CTCCGGCAAA ACGCGCAGTA CCTGCCGGAA ACGCTCAGCA AGCGAAAACT CATACAGGGA TCGAGAAAGA TCAGCGACAG GGAGTTATTC AGAGATATGC CGTTTTCGCT CTTTTTGAGG AAGCTGAATC AATTTACGTT TTTGGTCCGG AAAGGTGGCC CCAGGCGATC ATATCCCTGA
|
Protein sequence | MIEPERLRPA VDIIIPHFRG LEMLERSLES LEKTRYPSMG IIVVDNGGDQ AGLVFLVKRF RNARLLRLRE NKGYAGGCNE GLRFSSAEYL VFMNDDTEHD PFWLEHLVQS AEADKGIGAL QPKILSLKAW RKRQRVFDYA GAAGGMIDRF GYPWCLGRNF MRIEQDSGQF DKSQEIFWAS GVALFARRSV IEQVGGFDER FFMHMEEIDL CWRMKLAGFS IRSQPLSVVF HEGAASMPEG SAEKIFLNHR NNITMLLKNR GALSLLPVVS VRLFLEFAAA LFYLMQGSGG VKKFRAVFRA LRQNAQYLPE TLSKRKLIQG SRKISDRELF RDMPFSLFLR KLNQFTFLVR KGGPRRSYP
|
| |