Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_1100 |
Symbol | |
ID | 5208047 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | - |
Start bp | 1375091 |
End bp | 1376977 |
Gene Length | 1887 bp |
Protein Length | 628 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640594714 |
Product | glycosyl transferase family protein |
Protein accession | YP_001275458 |
Protein GI | 148655253 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 0 |
Fosmid unclonability p-value | 0.0000283415 |
Fosmid Hitchhiker | No |
Fosmid clonability | unclonable |
| |
Sequence |
Gene sequence | ATGCCCCGCC ATCTTCTGCC GGTCGTTGCC GTAACAGTCG CCGGAACCGG CACCCTGCTG GTCGCTGCGC CGGGATGGGT CATTGCCGGC GCCGTCTTCC TCATCTGCGG CGCGTGGTTA TCCGCACTCG CGCCCGGTCG CTCGTCCGGC GTTCGTCCGG CAGCGCCATC GTCCATACCT TCCCCCAATC CCTGGCAGCA CGTCGCTCTT GGGAGCATCG TCACCATTGC GCTTGTGAGC ATACTTGTCC CCGGAATAGC AGATACAGAG AGCCGCTATC ATGTCGATGA GAGCTACTGG GTTCCGGTGG GTGTTCAGGC ATTTCGAACC GCATTTATCG AGCGCGATCT AGACCATCAG TTTTGGTTTG ATTATCTCAT GAAATTCGGT TCACCCCATC CGCAGATCGG CAAATATATC ATCGGCGCCG GAGCATACCT GGCCGGGTAT CACGATGTGC CGTTGCTGCC GTATGACTTT GGGCAAGACC TGGCGTGGAA CAAGGCGCAC GGACGGGTCT TGCCGCCGGA GATTGTCGGC GCGGCACGTC TCTCCGTTGC ACTCACCGGC GCGCTGTGCG GGGCGTTCCT CTACTGGCTC GGCGTCCAGG TCGCCGGACC TGTCACCGGC ATCCTCGCCG TCGTTTTGTT TATTGCAACG CCAGCCGTAT GGAATCTCGC TCGCCTCGCT ATGCTGGATA TTCCCGCGCT GATGTTCGGA CTGCTTGCGC TGAATCTGGG TATCCGCGCA GTGACCGCTC TGCGCACGGG GTCCGCCAAC GCTGGCGCGT GGATTGCAGC CTGCGGCGCC GCCTGCGGCG CTGCCGTCGG CGCTAAACTG AATGCGTTGC TTATTCCCGG CATCTGCATC CTTGCGATGT GTTTGACCGG CGTTGCGCAC CAGCACCAAT CTGACAGGTA CACATTGATC TCGGGCGTTG TGTCACTTCT CCTCTGGACG TGGGTGGTTT TCTTCTTATC CAATCCCATG CTTTACCCCC ATCCCGTCGC TGGCATACAG CACATGCTTG ACATGAGCAG AATAGCGGCC TCAGGCGAGT TCGCTCCGCT CCCGACGCTG GCATCGCGCA TCAGCGCAGT CTGGACGAGT CTCGGCGACG GTGGAGGTAT CGGCAGCGGC GGCTTGCCTG GCAGTCGGCT CTGGCTGATC ATTGGCGCTA TCTTCCTGGC TCGCGCATTC CTGCAACGGC GACAGGAAGC GCGCTTTTCG GCGCTGTCGG TCATAGCGCT TTGGGGAGGC ATCAGTTTTG TGGGCATCAC GCTCTGGATT CCTCAGAATG TGAACCGCTA CTACCTGCCA TTAGCGCCGA TCGCCGCACT GCTCCAGGCA TATGGCATCA TTGAAATCAT CAATGTTTAT CGGGGTAGTC TGCTTTCTAT TTACTCCAAA ATTGGTTTCT CTCTAGGACT TGTGTTAATT TCCGTCGCAA TAAACTATTA CGAAACAACC TATCCTGCTG CAATAAATTC TTACGAAATA AATCACCGCT TTTACCCTGC TGCCGAACTC CCTTCCCAAC TCGGGAGAGT CATTGGTAGC AGCAGAGAGA TTACCAACGA TATGAAAGCC TCTGGTTTCT TGAGTTACGG TCCGTATGCC AACTTGCCAC CAGGCAGCTA TGTTGCCATA TTTGAATACA AGAGCGATGC ACGTTCTGAC ACAAGCATTG GATTGGTTGA TGTTACCGCC GACATGGGAA GGACGGTGAT AACACAGCAA AAGGTGTACG GAACGAATGG GTCTCCGAGT TCTATTGAAA TTCCATTTAT CCTCCAAGAG AGACAGAAGA TTGAAGTCAG GTTTTGGTAT GACGGAAATG GAACAGGGTC TACTTCTCTG CGAAGCCTTA CCATTCGTCC CAGATAG
|
Protein sequence | MPRHLLPVVA VTVAGTGTLL VAAPGWVIAG AVFLICGAWL SALAPGRSSG VRPAAPSSIP SPNPWQHVAL GSIVTIALVS ILVPGIADTE SRYHVDESYW VPVGVQAFRT AFIERDLDHQ FWFDYLMKFG SPHPQIGKYI IGAGAYLAGY HDVPLLPYDF GQDLAWNKAH GRVLPPEIVG AARLSVALTG ALCGAFLYWL GVQVAGPVTG ILAVVLFIAT PAVWNLARLA MLDIPALMFG LLALNLGIRA VTALRTGSAN AGAWIAACGA ACGAAVGAKL NALLIPGICI LAMCLTGVAH QHQSDRYTLI SGVVSLLLWT WVVFFLSNPM LYPHPVAGIQ HMLDMSRIAA SGEFAPLPTL ASRISAVWTS LGDGGGIGSG GLPGSRLWLI IGAIFLARAF LQRRQEARFS ALSVIALWGG ISFVGITLWI PQNVNRYYLP LAPIAALLQA YGIIEIINVY RGSLLSIYSK IGFSLGLVLI SVAINYYETT YPAAINSYEI NHRFYPAAEL PSQLGRVIGS SREITNDMKA SGFLSYGPYA NLPPGSYVAI FEYKSDARSD TSIGLVDVTA DMGRTVITQQ KVYGTNGSPS SIEIPFILQE RQKIEVRFWY DGNGTGSTSL RSLTIRPR
|
| |