Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_3369 |
Symbol | |
ID | 5210346 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | + |
Start bp | 4224735 |
End bp | 4225988 |
Gene Length | 1254 bp |
Protein Length | 417 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640596966 |
Product | glycosyl transferase, group 1 |
Protein accession | YP_001277679 |
Protein GI | 148657474 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.00689338 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCACATCG GTATCGATAT TTCGCTGTTG CGAATTGCTC AGGCGGGCGT TCTCACCTAT CACCGGGCGC TGCTCAACCA CCTGGTGCGC GTGGGACGCG ACTGTCATTT TACGCTGATC GACGTTCTGC CGCTCAACCC TGGTCGCCCA ATGGCGCCGC TGGCAGCGCT CGATGCACCG AACGTGCGGG TGGTGCGTTG CCCGGGCATC CGGCGCGGCT ACCTCAGCGC ACGTCCTGCG TTCCACCACG GACCTGCGCA TACCATCACC GCATGGATCG ACCGTCTGCT CGACCCGATC TGGGCGCAGG CGGCAGTCAT CGAGATGGGA GTGGAACTCC GCGCTGCCAC ACGCCGTGTC GAGGTGTTCC ATGCCTCCGA TCAACTCCCC TTTGCTCCGC CCGGCGCTGC GGTTGTGCTG ACCATTCACG ATCTGACCAC GCGCCGCTTC CCCGACATGC ATATGCCGGA CAATACCGCA CTCCACGCCA TCAAAGAGCG CTTCGCCCGC ACCCGCGCCG ACCGTATCAT CGCCGATTCT GAAGCGACCC GACGCGACAT CGTGCGTGAA CTGAACATCC CGCCAGAAAA AATCAGTGTA GTGTACGCAG CAGCAGATAC GCGCTTCCGT CCCCATACTC CCGAAGAAAC ACAAACGACC CTGGAGCGCT ACAGTCTGAC GCACAACCGG TACATCCTGA GCGTTGGCAC CCTCGAACCG CGAAAAAACC ACGTCCGCCT GATCGAAGCG TATGCGATGC TCCGCGCCAG ATATACGCCA GCCGGACATC TGCCGCCGCT GATCATCGCT GGCGGCAATG GATGGAAGTA CGACGCAATC CTGGCAGCGC CGGAACAGAC CGGCGTCGCC GGGTTCGTCC GGTTTCTTGG GCGCGTCCCC GATGATGACC TGCCCGCGCT GATTGCTGGA TCGCGCGTGT TTGTGTATCC TTCGTTGTAT GAAGGATTCG GGTTGCCGCC GCTGGAAGCG CTCGCCTGCG GTGTACCAGT GGTTGTGTCG CACGCAGCGT CGCTGCCCGA AGTGGTTGGC GACGCCGGGC TATACTGCGA CCCGTATGAC CCGCACGATA TTGCACGCCA GATAGCGGCG CTGCTGGAAG ATAACGACCT GTCGCTGCGA TTGCGGTGTG CTGGCGTTGA ACGTGCGAGG CAGTTCTCGT GGGAGCGCGC CGCCCGTGAG ACGCTCGCCG TCTACGCACA GGCGCGTGAT GAACGAAAGG CGAGACGACG ATGA
|
Protein sequence | MHIGIDISLL RIAQAGVLTY HRALLNHLVR VGRDCHFTLI DVLPLNPGRP MAPLAALDAP NVRVVRCPGI RRGYLSARPA FHHGPAHTIT AWIDRLLDPI WAQAAVIEMG VELRAATRRV EVFHASDQLP FAPPGAAVVL TIHDLTTRRF PDMHMPDNTA LHAIKERFAR TRADRIIADS EATRRDIVRE LNIPPEKISV VYAAADTRFR PHTPEETQTT LERYSLTHNR YILSVGTLEP RKNHVRLIEA YAMLRARYTP AGHLPPLIIA GGNGWKYDAI LAAPEQTGVA GFVRFLGRVP DDDLPALIAG SRVFVYPSLY EGFGLPPLEA LACGVPVVVS HAASLPEVVG DAGLYCDPYD PHDIARQIAA LLEDNDLSLR LRCAGVERAR QFSWERAARE TLAVYAQARD ERKARRR
|
| |