Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_0354 |
Symbol | |
ID | 4569526 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 394874 |
End bp | 396850 |
Gene Length | 1977 bp |
Protein Length | 658 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 639764952 |
Product | glycosyl transferase family protein |
Protein accession | YP_910837 |
Protein GI | 119356193 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATAAGAG CTGACCAGAA AGCACAGAAC GATTTTGGCG CAGGGGAATG CCTGGATACA GAGCTTATAA CCAGAAATAA CGACCTGCAG GCACTTCTTT CTGAGCAGGA ACGGAGAATA TATGATCTCC TTCAGCAATA CCACAAAAAC GAGTCAGAAT TACGCTCCGT CTACAATACC CCCATAGGAA AAATTGTCAG ATACTATAAA ACCATCAAGC AAAAAAAGAA AACGCAGAAA AACGCAGAAA AAAACGACTA CAATATCTGG CTGAACAAGT ATGACCTTTT AACTGAAAAG AGACGAAAAA ACATCATTGC AGAAATCGCT ACCAAACGTG AGTCTCCCCT GATTTCGGTA TTCATGCGTC TTAATAAACG ATCACTCTCC TCGCTTGAAC AATCCGTTCA GTCAGTCAAA GCCCAACTTT ACCCACATTG GGAACTCTGC ATCATTGCAG AGGCATCGCA ACATACAGAA GCAGAAACTG CGATCAGAAA ATTTGTGGAG AACGATGCAC GAATAACCCG CCACGTCAAA AAAGAGACCA ACACCATATC GGAGGCAAGC AATGCAGCTC TGGAACTTGC TGGCGGAGAA TTTTTCGCTC TTCTTGAGAG TGGCGACACC ATTCATCCTC TCGCCCTTTA TCATGTTGCA CAAGAGGTCA TGCGCTATCC TGAAGCGGGG TTGCTCTATT CCGATGAGGA CTCATTGGAC AACAACAATA AAAGAGTAAA CCCTTTTTTC AAGCCCGATT TCAATTATGA TCTGTTTCTT TGCCAAAACA TGGTAGGTAA TCTTGCGGTT TTCAAAACAT CATTGGCCAG GCAAACGGGG GGGTTTAAAA GAGAGCTTGA TGGAGCTCAG GATTACGACC TCGCTCTAAG GTTTTATGAA AAACTGAAAC CCGAGCAGAT CCGTCACATT CCCAGAGTGC TTTATCACAA GCGAATTTCC CGATCCGGAA CCATAGCAGC AACAGAGACT CAAATATCCG GCAATCAGGA AGCAGCGCTG CTTGCTGTCA ACCATCATCT CAAACGAACC GGAATAGAGG CAACCGTTGA AAAAGCGCCG GAATATCCGG AATGCAACCG AATACGATAC ACCATCCCCA ACACTCCACC ATCGGTTGAC ATCATCATAC CGACTAAAGA CATGGCGAAC CTTCTGAAAA TCTGTGTTCT GTCAATTCTC GCCAAAACAA CCTATAACAA CTATTCGATA ACCATCATTG ACAATGGGTC AAAAGAGCAG AATACTCTTG ACCTCCTGAA ACAGTGGAAA AATGACTCCC GTATCAGAAT AATACGCGAC GACGAAACAC CATTTAACTA TTCAAAACTT AACAACAGAG CAGTACACTC TTCTTCTGCG GATTTTATCT GCCTGATGAA CAATGATATT GAAATCATAA CGCCTGAATG GCTTAACGAA ATGATGGGGC ATGCAATACA ACCTGGAGTT GGCGCGGTTG GCGCAAGACT CTGGTATCCA AATGCAACGC TACAACACGC TGGTGTCATT ACGGGGATGT ATACAGGAAC TGGCCATGCA CATAAAAAAT ACCCTAAAGG AAATCCGGGA TATTTCGGAC GAGCCTGCCT GCAACAGGAA TATTCAGCCG TTACAGGCGC CTGTCTCTTG ATCAACAGAA TAAATTACCT GCATGTTGCT GGTTTAAATG AACAAGAGCT CACCGTTGCA TTTAACGATA TTGAGCTCTG CCTGAAACTG AAAAAAAAGG GACTGCGAAA TATCTGGACA CCCTATGCGG AAATGTTTCA TCACGAATCT TTGACAAGGG GCCGCAACGA CACCCCTGAA AAAAAAGAAC TTGCAGGAAA AGAACTTGCA TACATGCAAA ACACATGGGG AATTGATAAA AATCATGATC CTGCCTACAA CCCGAATCTG TCCATTACAA GCGATGATTT TTCACTGTCC TGGCCTCCTC GAATACTTGA TCGATAA
|
Protein sequence | MIRADQKAQN DFGAGECLDT ELITRNNDLQ ALLSEQERRI YDLLQQYHKN ESELRSVYNT PIGKIVRYYK TIKQKKKTQK NAEKNDYNIW LNKYDLLTEK RRKNIIAEIA TKRESPLISV FMRLNKRSLS SLEQSVQSVK AQLYPHWELC IIAEASQHTE AETAIRKFVE NDARITRHVK KETNTISEAS NAALELAGGE FFALLESGDT IHPLALYHVA QEVMRYPEAG LLYSDEDSLD NNNKRVNPFF KPDFNYDLFL CQNMVGNLAV FKTSLARQTG GFKRELDGAQ DYDLALRFYE KLKPEQIRHI PRVLYHKRIS RSGTIAATET QISGNQEAAL LAVNHHLKRT GIEATVEKAP EYPECNRIRY TIPNTPPSVD IIIPTKDMAN LLKICVLSIL AKTTYNNYSI TIIDNGSKEQ NTLDLLKQWK NDSRIRIIRD DETPFNYSKL NNRAVHSSSA DFICLMNNDI EIITPEWLNE MMGHAIQPGV GAVGARLWYP NATLQHAGVI TGMYTGTGHA HKKYPKGNPG YFGRACLQQE YSAVTGACLL INRINYLHVA GLNEQELTVA FNDIELCLKL KKKGLRNIWT PYAEMFHHES LTRGRNDTPE KKELAGKELA YMQNTWGIDK NHDPAYNPNL SITSDDFSLS WPPRILDR
|
| |