Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_0690 |
Symbol | |
ID | 4569844 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 788344 |
End bp | 789888 |
Gene Length | 1545 bp |
Protein Length | 514 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 639765288 |
Product | glycosyl transferase family protein |
Protein accession | YP_911169 |
Protein GI | 119356525 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGCCGG AGAGACAAAA CTATAACAAA ATTATCCATG CCTCGATAGT AGGGGTTCTG GTCATTGCAA GTTTTTTTTA CGGTCTCGGT TCCGGGCCGC TCTTTGATGT TGACGAAGGG GCATTCAGCG AGGCAACCCG GGAAATGCTT GCGACCAAAA ACTATCTGAC CACCTACCTC AATGGAGTGC CACGATTTGA CAAGCCTATT CTAATCTACT GGTTCCAGGC GCTTTCGGTA ACCCTTTTCG GAATGACCGA ATTCGCTTTC AGGCTCCCCT CTGCGCTTGC CAGTGCGATC TGGGCTGGAG CTATCTATCT TTTTGCCCGA AAGGAACTCG GTTCCCGCAG GGCGTTTCTG GCCGCTGCGC TCATGATTCT CTCCCTGCAG GTGACCATCA TTGCCAAGGC CGCCATTGCC GATGCTCTTT TGAACTGCTT TCTTGCCGTC AGCATGTTTG CTGTATACCG GCATCTGACT ACCGGATCGA AACCTGCCAG AAACCTTGCC TTTGCGGCTA TCGGGCTGGG TGTTCTGACC AAGGGTCCGA TAGCGATGCT TATTCCTCTT GCTGTTTCCT TTCTTTTCTC GCTTCAGCAG AGATCACTGA AAAAATGGCT CACGACCGTT CTGAACCCGA CAGGCATCAT CATTTTTCTT CTTATTGTGA TGCCATGGTA CACGCTCGAA TACCTCGATC AGGGCATGGC CTTTATTCAG GGGTTTTTTT TCAAACATAA TATCAACCGT TTCAACTCCT CGCTCGAAGG GCATTCCGGA TCTCTCTTCT ACTATTTTCC GGTCATTCTC CTGGGACTTA TGCCATTTAC CGGCCTCCTG TTCACCACGC TCTTCAACCT GAAAAAACTG CTTGCCGAGC CGCTCAACAT CTTTCTTGCA ATCTGGTTCG GCTTTGTATT CATCTTTTTT TCGCTTTCCG GCACCAAACT GCCGCACTAT ATGATCTACG GATATACGCC TCTTTTTCTC CTGATGGCAA GAGTATTCGA TTCCGGCAAA CACCCGAAAC TGCTGGTAGT ATGGCCTCTG CTCTTTATCA CCCTCCTGGG GGCCCTCCCG TATGCTCTCC CCATGGCAAT GGAACGGATT GATGATGCCT ATATCACTGC CATCCTCCAT GATGCCCGTA TCATTCTTGA TAGTACGTAT CTCATGGTCA TCGGGGTAAG CGGGCTGCTC CTTGTGGCCA CGATAACGAT CCCTGTGCTG AAAGCTTCAG GAAGGCTGAT CGCGCAGGGG GTTGTTCTGA CGCTGCTGGT GAATCTCTTC CTGATGCCCA TTGCAGCCGA ACTCCTGCAG ATACCGGTCA GGGAAGCGGC AATTCTTGCC AGAAAGGAGG GGTATAAAGT TGTGATGTGG AAAGTCTATT ACCCCTCTTT TTTCGTATAT TCCGAAACTT TTGCAGAGAG ACGAGCTCCC GAAAAGGGCG ATATCGTGCT TACAACCGTC AAATACCTTC CAGGGTTTGA CAATCCTGAT GTACTCTACC AGAAGCACGG CATTGTACTG GTAAACAATA AATAA
|
Protein sequence | MLPERQNYNK IIHASIVGVL VIASFFYGLG SGPLFDVDEG AFSEATREML ATKNYLTTYL NGVPRFDKPI LIYWFQALSV TLFGMTEFAF RLPSALASAI WAGAIYLFAR KELGSRRAFL AAALMILSLQ VTIIAKAAIA DALLNCFLAV SMFAVYRHLT TGSKPARNLA FAAIGLGVLT KGPIAMLIPL AVSFLFSLQQ RSLKKWLTTV LNPTGIIIFL LIVMPWYTLE YLDQGMAFIQ GFFFKHNINR FNSSLEGHSG SLFYYFPVIL LGLMPFTGLL FTTLFNLKKL LAEPLNIFLA IWFGFVFIFF SLSGTKLPHY MIYGYTPLFL LMARVFDSGK HPKLLVVWPL LFITLLGALP YALPMAMERI DDAYITAILH DARIILDSTY LMVIGVSGLL LVATITIPVL KASGRLIAQG VVLTLLVNLF LMPIAAELLQ IPVREAAILA RKEGYKVVMW KVYYPSFFVY SETFAERRAP EKGDIVLTTV KYLPGFDNPD VLYQKHGIVL VNNK
|
| |