Gene RoseRS_1100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_1100 
Symbol 
ID5208047 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp1375091 
End bp1376977 
Gene Length1887 bp 
Protein Length628 aa 
Translation table11 
GC content56% 
IMG OID640594714 
Productglycosyl transferase family protein 
Protein accessionYP_001275458 
Protein GI148655253 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000283415 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGCCCCGCC ATCTTCTGCC GGTCGTTGCC GTAACAGTCG CCGGAACCGG CACCCTGCTG 
GTCGCTGCGC CGGGATGGGT CATTGCCGGC GCCGTCTTCC TCATCTGCGG CGCGTGGTTA
TCCGCACTCG CGCCCGGTCG CTCGTCCGGC GTTCGTCCGG CAGCGCCATC GTCCATACCT
TCCCCCAATC CCTGGCAGCA CGTCGCTCTT GGGAGCATCG TCACCATTGC GCTTGTGAGC
ATACTTGTCC CCGGAATAGC AGATACAGAG AGCCGCTATC ATGTCGATGA GAGCTACTGG
GTTCCGGTGG GTGTTCAGGC ATTTCGAACC GCATTTATCG AGCGCGATCT AGACCATCAG
TTTTGGTTTG ATTATCTCAT GAAATTCGGT TCACCCCATC CGCAGATCGG CAAATATATC
ATCGGCGCCG GAGCATACCT GGCCGGGTAT CACGATGTGC CGTTGCTGCC GTATGACTTT
GGGCAAGACC TGGCGTGGAA CAAGGCGCAC GGACGGGTCT TGCCGCCGGA GATTGTCGGC
GCGGCACGTC TCTCCGTTGC ACTCACCGGC GCGCTGTGCG GGGCGTTCCT CTACTGGCTC
GGCGTCCAGG TCGCCGGACC TGTCACCGGC ATCCTCGCCG TCGTTTTGTT TATTGCAACG
CCAGCCGTAT GGAATCTCGC TCGCCTCGCT ATGCTGGATA TTCCCGCGCT GATGTTCGGA
CTGCTTGCGC TGAATCTGGG TATCCGCGCA GTGACCGCTC TGCGCACGGG GTCCGCCAAC
GCTGGCGCGT GGATTGCAGC CTGCGGCGCC GCCTGCGGCG CTGCCGTCGG CGCTAAACTG
AATGCGTTGC TTATTCCCGG CATCTGCATC CTTGCGATGT GTTTGACCGG CGTTGCGCAC
CAGCACCAAT CTGACAGGTA CACATTGATC TCGGGCGTTG TGTCACTTCT CCTCTGGACG
TGGGTGGTTT TCTTCTTATC CAATCCCATG CTTTACCCCC ATCCCGTCGC TGGCATACAG
CACATGCTTG ACATGAGCAG AATAGCGGCC TCAGGCGAGT TCGCTCCGCT CCCGACGCTG
GCATCGCGCA TCAGCGCAGT CTGGACGAGT CTCGGCGACG GTGGAGGTAT CGGCAGCGGC
GGCTTGCCTG GCAGTCGGCT CTGGCTGATC ATTGGCGCTA TCTTCCTGGC TCGCGCATTC
CTGCAACGGC GACAGGAAGC GCGCTTTTCG GCGCTGTCGG TCATAGCGCT TTGGGGAGGC
ATCAGTTTTG TGGGCATCAC GCTCTGGATT CCTCAGAATG TGAACCGCTA CTACCTGCCA
TTAGCGCCGA TCGCCGCACT GCTCCAGGCA TATGGCATCA TTGAAATCAT CAATGTTTAT
CGGGGTAGTC TGCTTTCTAT TTACTCCAAA ATTGGTTTCT CTCTAGGACT TGTGTTAATT
TCCGTCGCAA TAAACTATTA CGAAACAACC TATCCTGCTG CAATAAATTC TTACGAAATA
AATCACCGCT TTTACCCTGC TGCCGAACTC CCTTCCCAAC TCGGGAGAGT CATTGGTAGC
AGCAGAGAGA TTACCAACGA TATGAAAGCC TCTGGTTTCT TGAGTTACGG TCCGTATGCC
AACTTGCCAC CAGGCAGCTA TGTTGCCATA TTTGAATACA AGAGCGATGC ACGTTCTGAC
ACAAGCATTG GATTGGTTGA TGTTACCGCC GACATGGGAA GGACGGTGAT AACACAGCAA
AAGGTGTACG GAACGAATGG GTCTCCGAGT TCTATTGAAA TTCCATTTAT CCTCCAAGAG
AGACAGAAGA TTGAAGTCAG GTTTTGGTAT GACGGAAATG GAACAGGGTC TACTTCTCTG
CGAAGCCTTA CCATTCGTCC CAGATAG
 
Protein sequence
MPRHLLPVVA VTVAGTGTLL VAAPGWVIAG AVFLICGAWL SALAPGRSSG VRPAAPSSIP 
SPNPWQHVAL GSIVTIALVS ILVPGIADTE SRYHVDESYW VPVGVQAFRT AFIERDLDHQ
FWFDYLMKFG SPHPQIGKYI IGAGAYLAGY HDVPLLPYDF GQDLAWNKAH GRVLPPEIVG
AARLSVALTG ALCGAFLYWL GVQVAGPVTG ILAVVLFIAT PAVWNLARLA MLDIPALMFG
LLALNLGIRA VTALRTGSAN AGAWIAACGA ACGAAVGAKL NALLIPGICI LAMCLTGVAH
QHQSDRYTLI SGVVSLLLWT WVVFFLSNPM LYPHPVAGIQ HMLDMSRIAA SGEFAPLPTL
ASRISAVWTS LGDGGGIGSG GLPGSRLWLI IGAIFLARAF LQRRQEARFS ALSVIALWGG
ISFVGITLWI PQNVNRYYLP LAPIAALLQA YGIIEIINVY RGSLLSIYSK IGFSLGLVLI
SVAINYYETT YPAAINSYEI NHRFYPAAEL PSQLGRVIGS SREITNDMKA SGFLSYGPYA
NLPPGSYVAI FEYKSDARSD TSIGLVDVTA DMGRTVITQQ KVYGTNGSPS SIEIPFILQE
RQKIEVRFWY DGNGTGSTSL RSLTIRPR