Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_2744 |
Symbol | |
ID | 5540230 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 3547932 |
End bp | 3549638 |
Gene Length | 1707 bp |
Protein Length | 568 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640894870 |
Product | glycosyl transferase, group 1 |
Protein accession | YP_001432833 |
Protein GI | 156742704 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0287035 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTCTGG TGATCATCAG CCACGATACG GTAGGTCAGC GAATGGCCGG TCCGGGCATC CGCGCCTGGG AATTGGCGCG GGTGCTGGCG CTCCACGCCG ATGTGATGTT GCTCGCGCCG CAACCTATCG ACCTGGTCGC GCCGGGAGTG CGCACCGGTC ACTTCATTCT GGGCAATAGC GCCTCGCTGG TAGAGTATTT GCGCCAGGCG GATGTCATTC TGGCAAACGG ATTTTTGCTC GAGTCCCACC CGGAACTGGC AGATGCACGA CAACCACTGA TCCTGGACAT GTACGATCCA ACGGTGTTGG AGAATATTGA ACTCTTCCGC GCCGCGTCAC TTCCAGAGCG CCAGGATCGC GCCCGCCGCG ACATTGCGTT GCTCAATCGA CAACTCACGG CTGGCGATCT ATTCCTCTGC GCCACCGAAC GCCAGCGCGA CCTGTACCTG GGCGCACTCA TGGCTGCCGG ACGCATTACC CCTGATCGTG TCGATGCCGA TCCGCTCCTG CACAACCTGG TCACTGTTGT GCCCTTCGGG TTGCCTGCCA CTCCGCCGGT GCGCACCGGT CCCGGCATAC GGGGCGTCAT TCCCGGCATC GGTGAGACGG ATCCGATCAT TCTCTGGAAC AGCGGCATGT GGGACTGGCT GGATCCATTG ACGCTTATCC GCGCAATGAA GCAGGTTGTG ACAGCCATTC CCAACGCGCG CCTGGTTTTC CTGGCGGGCA AACATCCCGG CGGCGCGGCG CCGATGCAAA TGCCCGACGC AGCCCGCGCG TTGGCGTCCG AACTGGATGT CCTCAACCGG CATGTGTTCT TTTATGAGGC CTGGATTCCA TACGCCGACC GCGCGAATAT CCTCCTCGAT GCAACCATGG CAGTGACGCT CCACCGCCAA CACCTCGAAA TGGCATACGC CGCCATTCGC TCGCGGGTGC TCGATTACCT GTGGACCGGG CTGCCGGCTG TGCTCAGCGA CGGCGATCCG GCGGCTGCGC TGGCACGGCA ACACGGGTTT GCGCTGGTGA CCCCGCCGGA AGACCGGGAA GCGGTCGCGC ATGCGATCAT CACCCTCTTG ACCGATGAAG CCAGGCGCCA TGAACTTGCC GCACACGCGC GCGCCCTTGC GCCACGGTAT ACCTGGAACA CCGTTGCACA GCCGATCATC ACGTTTCTCG CTTCCATACC GACTTCCAGG CTGCGCGCCA CCGAACGAAG CGAACAGGTT GACGCGGCGC AACACGTGGA AACAGCGCCG CTGACTGCGC GGCGACAGAC GCTTCAGGCG CAGCGCAATA GCGCATTGCA GGCGCTCGAA GCCACTTGGC GGCTTGATCG GTTGACGCCT CCGGCGCAAG GATTGCCGGG CAAGGCGCGC AACTTCGTTC TGGATCGGAT CGTCTGGCCC TTGACTGCAT CGCTGATTGC CCGCCAACGC GACCACAACG CTGCGGTTAT CCGCGCTGCC TATGCGATGG CGGAATATCA GGATCATCTC TCGAATGACA TCACCCGCCT GATTGCTGCG GTTCGCCTGC TTTCCCATCA GACGCGCGAC ATCATTGAGC ACATCACCGA ACTGCACGAA GCAGACCAGA ACCTGCGCAC CGCGCTCTAT GACGATCCCC CTCCTCCACC GCCGCGCATC ATTCCGCCGA AAACGAATAT CGACACACTA ATGGCAGCGG AGCACCATGA TGAGTAG
|
Protein sequence | MRLVIISHDT VGQRMAGPGI RAWELARVLA LHADVMLLAP QPIDLVAPGV RTGHFILGNS ASLVEYLRQA DVILANGFLL ESHPELADAR QPLILDMYDP TVLENIELFR AASLPERQDR ARRDIALLNR QLTAGDLFLC ATERQRDLYL GALMAAGRIT PDRVDADPLL HNLVTVVPFG LPATPPVRTG PGIRGVIPGI GETDPIILWN SGMWDWLDPL TLIRAMKQVV TAIPNARLVF LAGKHPGGAA PMQMPDAARA LASELDVLNR HVFFYEAWIP YADRANILLD ATMAVTLHRQ HLEMAYAAIR SRVLDYLWTG LPAVLSDGDP AAALARQHGF ALVTPPEDRE AVAHAIITLL TDEARRHELA AHARALAPRY TWNTVAQPII TFLASIPTSR LRATERSEQV DAAQHVETAP LTARRQTLQA QRNSALQALE ATWRLDRLTP PAQGLPGKAR NFVLDRIVWP LTASLIARQR DHNAAVIRAA YAMAEYQDHL SNDITRLIAA VRLLSHQTRD IIEHITELHE ADQNLRTALY DDPPPPPPRI IPPKTNIDTL MAAEHHDE
|
| |