Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_1662 |
Symbol | |
ID | 5539138 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 2143157 |
End bp | 2144194 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640893799 |
Product | glycosyl transferase family protein |
Protein accession | YP_001431772 |
Protein GI | 156741643 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.363785 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.00433752 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGATGTTC CTTCCATTTC CCTGGTCTGC ACCGTTCGCG ATGAGGCGGA CAATATTGCA GAACTGATCG ATTCGATGCT GGCGCAATCG TTGTTGCCGG ACGAGATTGT GATCAACGAC TGCCAGAGCC GCGATGCAAC GCCACAGATC GTTGCCCGCT ACATTGCATG CGGAGCGCCC ATTCGTCTGG TGCAGGGCGG TCACAATATC CCTTCGGGGC GCAATAACGC CATTCGCCAC GCACGCGGCG CGCTGATCGC CTGTACCGAT GCCGGGCTGC GTCTCGAACC GCACTGGCTG GAACGCATCA CGGCGCCTAT CCGTCGGGGC GATGCCGACG TGGTTGGCGG ATTCTTCCGC CCTGCGCCGC GCAGCCTCTT TGAACTTGTG CTCGGCGCGA CCAACTACCG TGAGGCGGAA GAGATCGATC CGGCGCGCTT TTTGCCCTTT GGGAAATCGT GCGCTTTTCG TCGTGAGGCG TGGGAACGGG TCGGCGGTTA TCCAGAGTGG GCGAATCACT GCGAGGATGT ACTGTTTGCG CTGGCGCTCA AACATCTCGG ATTTCGCTTC GCCTTCGCTC CCGATGCGCT GGTCTTCTTT CGCCCGCGTT CGTCACTCGC CGCTTTTGCG CGGCAGTACT ACTTCTATGC GCGTGGTGAT GGCGTCGCCG GTCTCTGGAC GCTGCGCCAT GCCGTCCGTT ACGCCACGTA TATCACGGCA GCGATGCTGC TGCTGGCGAG CCGGCGGCAT TCGTGGACGC TGGCGCTGAT CGCAGCAGGT GCATGCGCGT ATGTACGCGA TCCGCTGTGG CGTCTACGCC AGCGCGCTCC GGCGCTCGAT GGCGTCAGTA TGTTGCGCGC GGCGTTGCTG ATCCCGGTGA TCCGTCTGGT CGGCGATGTT GCCAAGATGA TCGGCTACCC CGTTGGTGTC GTCCAACGCC TGCGCTCGGC ATCGTTACGA AGAAATGTCG TACATTACCG GAAGAGCGTC ACACTACAAC AGCACGACCT GACCGATGAG CGACACACCG GCGCGTAG
|
Protein sequence | MDVPSISLVC TVRDEADNIA ELIDSMLAQS LLPDEIVIND CQSRDATPQI VARYIACGAP IRLVQGGHNI PSGRNNAIRH ARGALIACTD AGLRLEPHWL ERITAPIRRG DADVVGGFFR PAPRSLFELV LGATNYREAE EIDPARFLPF GKSCAFRREA WERVGGYPEW ANHCEDVLFA LALKHLGFRF AFAPDALVFF RPRSSLAAFA RQYYFYARGD GVAGLWTLRH AVRYATYITA AMLLLASRRH SWTLALIAAG ACAYVRDPLW RLRQRAPALD GVSMLRAALL IPVIRLVGDV AKMIGYPVGV VQRLRSASLR RNVVHYRKSV TLQQHDLTDE RHTGA
|
| |