Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_0787 |
Symbol | |
ID | 6374454 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | + |
Start bp | 842558 |
End bp | 843619 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 642683295 |
Product | glycosyl transferase family 2 |
Protein accession | YP_001959219 |
Protein GI | 189499749 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.163096 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.605145 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGATCGTGA ACAATGAAAT GCCCTCTGTT GAGATAATTA TTCCTCATTA CAGAGGGATC GAGATGCTCG AACGCTGTCT CAACTCCCTT GCTGAGACCT CCTATCCTGC AATGAGCATC TGTATTGTGG ATAACGCAAG CGGCGAGAAA GAGGCGATTT CAGGGCTGAA AGGAACGTTC GAAGCCCTGC GGGTGGTCTC ACTGCCGTCA AATCGAGGCT ATGCGGGAGG ATGCAACAGT GCTCTTTTTT CTTCAACCTC CACCTACGTG GTGTTTCTGA ACGATGATAC CGTTGTCGAT CCATCCTGGC TTTCCTGCCT TGTTAACGCA GCCGAAGAGG ACGGCGGTAT CTCCGCACTT CAGCCGAAAA TCCTTTCATT GCCCGCACAG CGCAGCGGAA AAAAGGTTTT TGATTATGCC GGGGGGGCAG GAGGGCTCAT CGACAGGCTC GGTTATCCCT ACTGCTATGG CCGGACAGGC GCTCACACTG AACAGGATAA CGGTCAGTAT GACCGGGCGG GTGATATTTT CTGGGCTTCA GGTGTTGCCC TCTTCGCACG CAGGGATTGT GTTACGAATC TCGGCGGTTT TGACGAGGAT TTTTTCATGC ATATGGAGGA GATAGATCTC TGCTGGCGTA TGCGCCTTCA GGGTCAGCGA ATCGTTTCGG TACCCTCTGC CGTGGTCTAT CACGAAGGCG GAGCCTCTCT TGCGGAAGGC TCCGCTGAAA AAATATATCT GAACCATCGT AACAATATGG TTATGCTTCT GAAAAACAGG AGCAGTGCCG CGCTCTTCAT CGTTTTTCCT TTGCGTCTGC TTCTCGAATG CGCTGCAGCA GTTCTCTATC TTTCAACCGG AAGGCAGAGG ATACAACGGG CGATAAGTGT ATTTCACGCT TTGTTCGACA ATCTGAGGTG TTTGCCTGAT ATTTTCAGAA AACGTCGAGC GGTTCAAGCC ATGAGGAGGG TCGCTGACCC CGTGATTTTC CGTGACGCCC CGGTATCGAT TGTTCTCGGA AATATGACTT CTTTTCGAAA CGCGCAAAGA AGGACAGCAT AG
|
Protein sequence | MIVNNEMPSV EIIIPHYRGI EMLERCLNSL AETSYPAMSI CIVDNASGEK EAISGLKGTF EALRVVSLPS NRGYAGGCNS ALFSSTSTYV VFLNDDTVVD PSWLSCLVNA AEEDGGISAL QPKILSLPAQ RSGKKVFDYA GGAGGLIDRL GYPYCYGRTG AHTEQDNGQY DRAGDIFWAS GVALFARRDC VTNLGGFDED FFMHMEEIDL CWRMRLQGQR IVSVPSAVVY HEGGASLAEG SAEKIYLNHR NNMVMLLKNR SSAALFIVFP LRLLLECAAA VLYLSTGRQR IQRAISVFHA LFDNLRCLPD IFRKRRAVQA MRRVADPVIF RDAPVSIVLG NMTSFRNAQR RTA
|
| |