Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_3644 |
Symbol | |
ID | 5541146 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 4767574 |
End bp | 4769010 |
Gene Length | 1437 bp |
Protein Length | 478 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640895764 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001433711 |
Protein GI | 156743582 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0102527 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0000773484 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGCACATTC TGTTGATAAC GCCCTGGCTG ACGATTGGCG GCGCGGATCG AGTCCATCTT GACCTGATTC GTCAGTTGAA CCGGCGCGGC TGTCGGTTCA GCGTGGTGGC GACATTGCCG GCGAAGCACG AATGGCGCCC ATTGTTCGAG GAACTGACGC CTGATGTCGT CACACTGCAT CCGACCATTG CGCCAGCACA GCAACCGGCG TTCGCGCGCG ACCTGATCAG GTCGCGCGGC ATTCATGCAG TTCTCATCGG CAACAGTCAG TTCGGATATG CGTTGCTGCC GTATCTGCGG TCCTGTTGCC GGGATGTGGC GTTCCTTGAC ATACTGCACG CAGTAGAACC ACACTGGCGA GACGGCGGGT ATCCGCGCCT GTCGCTCGAC CATGCCGCCT GGCTCGATCT GAGCATCACC GTTTCACGCG ACTTGCGCGA CTGGATGATC GCGCGCGGCG GTGACCCGGC GCACATCGAG GTCTGCTATG CCAACATCGA TGTTGACGCA TGGAATCCGG CGCTTTTTGA CCGCGCAGCG TTGCGGCGAG CGTTCGGCAT CCCGCCGCGC GCGCCGCTGA TCCTGGTGAT CGGGCGATTG TCGTCGGAAA AGCGTCCACG TCTGGCGGTG CGCATCCTGC GGGAAGTGGC GCGCCAGGGT ATCGCCTTTC ATGCGCTGAT TATCGGCGAT GGACCGGAGC GCCCGGTGTT GGAGCGGATG CTGCGCGACC CATTGCTCCA GAACGTCCGC TTGACCGGCG CGTTGCCCGA AGAGCGGGTG CGGGAGGTTA TGGCAGCCGG GGACGTGCTG CTGCTACCCT CGGCGCGGGA GGGGATTGCG CTGGTGTTGT ACGAAGCGAT GGCGATGGGG ATGGTTCCGG TGGTCGCCGA TGTCGGCGGG CAGCGTGAAC TGGCAACACC CGACTGTGGC ATGCTTATTC CATCATCCAA GAGCGAAGAA GCGGCATACG TTGCGGCACT CGCCGGTCTG TTGCGCGATC CGGCGCGTCG CGCGGCGATG GGTGCTCAGG CGCGACGGCG GATCGTTGAC CATTTTCGGA TCGATCAAAT GGGTGATCGC ATGGAGATGC TGTTCAGGCG CGCGGTTGAG CGCGCGATGG GCGCCGAGCG TCCAATTCCG ACGGAAGGCG ACGCGGCGCG CAGCGCGGTC GAGGCGATCC GCCTTGCGCG TCACGCAAGG GATGTGGCGC GTTTGTGGCA AGCCGACAGG TATTCTGAGG ATGCGTCCCT CTCACCTGTG CGCCGCGCCG TGTTGGGTGT TGTGCGCAGT ATGCGGCAAC GGTTGCGCCC ATGGTACCGA CGTCTGGTCG GTCGTGATGG ACATCCCTTC AGCCGTGGAG TTGTTGCAGT GCGTGATCGG GTAGTGGCGT GGGTGTATGA CGAAGGGCGA GAGGCCCCCC CCGGTAGGCG GGTTTAA
|
Protein sequence | MHILLITPWL TIGGADRVHL DLIRQLNRRG CRFSVVATLP AKHEWRPLFE ELTPDVVTLH PTIAPAQQPA FARDLIRSRG IHAVLIGNSQ FGYALLPYLR SCCRDVAFLD ILHAVEPHWR DGGYPRLSLD HAAWLDLSIT VSRDLRDWMI ARGGDPAHIE VCYANIDVDA WNPALFDRAA LRRAFGIPPR APLILVIGRL SSEKRPRLAV RILREVARQG IAFHALIIGD GPERPVLERM LRDPLLQNVR LTGALPEERV REVMAAGDVL LLPSAREGIA LVLYEAMAMG MVPVVADVGG QRELATPDCG MLIPSSKSEE AAYVAALAGL LRDPARRAAM GAQARRRIVD HFRIDQMGDR MEMLFRRAVE RAMGAERPIP TEGDAARSAV EAIRLARHAR DVARLWQADR YSEDASLSPV RRAVLGVVRS MRQRLRPWYR RLVGRDGHPF SRGVVAVRDR VVAWVYDEGR EAPPGRRV
|
| |