Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_3645 |
Symbol | |
ID | 5541147 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 4769672 |
End bp | 4770832 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640895765 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001433712 |
Protein GI | 156743583 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.000243926 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATGACTC GACTGCGCCA ACGCCTGCCT GCCCTGCTGC GCGCCGTTGC CGCTCTGCCG CGAACAGTCA CGGCGCCGCG CCGCTCACCG CCGCGCATCC GGTTTGTCTA CCTTCATCCT GGCGCTGCAA CGCGCTACCG CGTCTGGCAT CAGGTGGAGC AGGCGCAGAT CGCCGGTCTG GCGGCGGATG CCGTTGCGCT GCACGACTCG GCGCGACTCT ACGATCTGTC GCAGGTCGAT CTGCTGATTG TGCATCGTCT GCCGCTGGCG GCGCTGACGT TGCCGCTTGT CGTTGCCGCC CGGTTGCGTC GCATTCCGCT GGTGTTCGAT AGCGACGACC TGGTGTGGGA TGAGCGTGAG CGCGAGTACA ATTTTCTCGA CCGTCATCAC GATCCGGTGA CGATTGCCCG CCTCCTGCGC GCGGCGCGCG GTATGCGGCG ATTGATGCGC CTGTCCGATG CGTTGATCCT TTCGACGCCA TTCCTTGCGG CGCTGGCATC CGCCGATGTT CGCCGTCCAT CGTTCGTCAG CCCGAATGTG CTGTCGCGCG AACAGGTTGC GCTGTCGCGC GTCGCGTTCG ATGAGCGACA GCAACGCCCG TTGCGCGCAC AACCGGTTAT CGGGTACTTT TGCGGGCATG CGCATGTTCA CGATGAGGAT ATTGCGACCG TCGGCGCTGC CCTGCGCATG GCGCTGGAGG AATGTTCTGA AGCGCGATTG CGCTGCTACG GCGAGGTGAC ATTACCGCCG GAACTGACAG ATGTGTCGGT GTGTGACCGC ATCGAGCGGC GTCCGGTGGT CGACTGGCGC GATCTACCGC GCCACATCGC GGCGGTTGAC ATTAATATCG CGCCGCTGGT CGATAACCCG CAGCGCCGTG GCAAGAGCGC CGTCAAATAT CTGGAAGCCG CGGCGGTAGG TGTGCCGACG GTTGCGGTTC GGCTCGAACC ATACCGGGAT GCGATTGATG AGGGTGTGAC AGGGGTGCTG GCAGCCACGC GCGACGAATG GGTTTCGGCG CTGATACGGT TGCTGCGCGA CCCGGAGTTG CGGCGGCGGA TGGGTGAGGC GGCGCGCGCT GATGTGCTTG CCCGCTTCAC AACGGAGCGT CAGGCAGAAC GATTTGCGCG GATGATCGGA GCGATTGGGA GAAGCGCGTG A
|
Protein sequence | MMTRLRQRLP ALLRAVAALP RTVTAPRRSP PRIRFVYLHP GAATRYRVWH QVEQAQIAGL AADAVALHDS ARLYDLSQVD LLIVHRLPLA ALTLPLVVAA RLRRIPLVFD SDDLVWDERE REYNFLDRHH DPVTIARLLR AARGMRRLMR LSDALILSTP FLAALASADV RRPSFVSPNV LSREQVALSR VAFDERQQRP LRAQPVIGYF CGHAHVHDED IATVGAALRM ALEECSEARL RCYGEVTLPP ELTDVSVCDR IERRPVVDWR DLPRHIAAVD INIAPLVDNP QRRGKSAVKY LEAAAVGVPT VAVRLEPYRD AIDEGVTGVL AATRDEWVSA LIRLLRDPEL RRRMGEAARA DVLARFTTER QAERFARMIG AIGRSA
|
| |