Gene RoseRS_0334 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_0334 
Symbol 
ID5207269 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp427570 
End bp429510 
Gene Length1941 bp 
Protein Length646 aa 
Translation table11 
GC content60% 
IMG OID640593960 
Productglycosyltransferase 
Protein accessionYP_001274716 
Protein GI148654511 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.184181 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.020995 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGATAG CGGAGCAGAA CGCTGCGAGC GATGCAAGCG TATGTGCAGA ACACCGGGCT 
GCTCGTGATG TTTTGAAGGC GCCGATCATC GCGCTGATGC TTGGTGTGCT TGCACTTGCG
CCGCGTGTCA TCGGGCTTGC CGATTTTCTC ACCACCGACG AAGCGTACCA CTGGATCCGT
TTTACCGAAC GTTTCGATGC AGCAATTTCC GAGGGGCGCT GGGCTGATAC CATTTTCGTC
GGGCATCCCG CCATCACGAT GTTCTGGTTG GGGCGCGCAG GATTGGTGCT CGAGCGCGCT
GCGCGCGATT TGGGCTGGAT AGGCGCCCCT TCGATGATCG AACATCTGGC CTGGCTGCGG
CTGCCGGGGG TGTTCCTGCA GGTGGTGTTT GGGGTAACCA CCTGGATGGT GTTGCGTCGC
CTCGTTGATC CGATGGTCGC GCTGGTTGCG GGATTCCTGT GGTCTACATC GCCATATCTG
ATTGCGCACG GGCGGGTGCT GCATCTCGAT GCACTGCTGA CGGGGTTGCT CACACTGAGC
CTGTTGCTCC TGCTGGTTTC CTTGCGGCAA CAGCAGGCAG GCGCAGGCGG ATGGACAGCG
CTGCTCGGTT CCGGTGCGTT GACCGGACTG GCGCTCCTGA CCAAAGGACC GGCGATCATT
TTTTTGCCAT TTGCCGGTCT GATGCTGTTC GCTCTTGCGC CTGCGAAAGA CGCCTCGAAC
CGACGTGTTT CCGGCGTGGT GTCGGATGTA TTTCGTCGCC TGAGGTATGC GATCGTGCGT
TATGGCGTAT GGCTGGGGGT TGCGCTCGGT GTTGCATTCG CCGGATGGCC CGCGCTGTGG
GTGACGCCGG AAGCGGCACT GCAAGCCTAT GTGGGTGAAA TTATCTTCAA CGGCGGACGT
CCCAACGGCG ATGGGCAGTT CTTCAACGGT CAGGCAGTTG GTGATCCTGG CGTGTGGTTC
TATCCGGTCG CCAGTCTGTT CCGCACGACG TCGGTGATGT TCATTGGTTT GGTCGCTTTT
GTGGTCTTTG CGGTGATCGA TGGCCGCCGC TTCTTCACGC AACGCGATGC CGTCATTCCT
GTCCTGATCG CTTTTGCCGC CTTCTGGACA CTGGTCATGA CGCTGGGTCC AAAGAAGTTC
GACCGATATG TCCTTCCGAT CTGGCCGGTG TTGCTCGTGC TGGCGGCAAC CGGAATCGTG
CGCGGGTACA ATGCTGCGCG GGCATGGTGC ATCCGGCGTG CGATTGTCGT GCCCCGGGGC
GGTGATTTTC TCAAACGCGC GCCTCTGGCG GGGTTGCTGA TAATGGGCGC AATAGAGATC
GGTCAGGTCG TCTGGTACCA TCCCTACTAT CTGAGTTACT ACAATCCCTT GTTCGGCGGC
GGTGCGGCAG CGCAGCGCAT GTTTCTGATC GGATGGGGAG AGGGTATGGA TCAGGTCGGC
GCATGGTTGA GTTCACGCCC TGATATCGGG TACGGACCGG TTATCTCGGC GCTCAGACCA
ACGTTGCAAC CGTTCGTTCC GGTCGATGTT CGTGACATCA CCGATCTGGG GAAACTGCCG
GTCAACTATG CCGTCGTCTA TCTGGAGTCG ATCCAGCGCG GCGCGCATCC TGATATCTAT
CGCCAGTTCG AGCCGATGAC TCCCATCCAT ACAATCACCA TTCATGGCAT CGAATATGCA
AAGATCTACC AGTTGCCGCG CCCATACCGG CAGCCGGTCG GCGCGCGCTT CGGCGATGCA
ATCATGCTCC ACGGCGTCTC AGTCGAATAT GATCAGAACC ATCTGACGGT CACGCCTTCG
TGGGGGGCGC TGGCGCCTCC GCAGGGCGAT TACGTCGTAT TCCTTCAGGT GATCGATGCA
CAGGGACAGC GGGTTGCCGG TGTGGACGTA CCGCCATCCG GCGTTGGGGG GATGCCGACC
GGCGCCTGGC TGCCGGGGTA G
 
Protein sequence
MQIAEQNAAS DASVCAEHRA ARDVLKAPII ALMLGVLALA PRVIGLADFL TTDEAYHWIR 
FTERFDAAIS EGRWADTIFV GHPAITMFWL GRAGLVLERA ARDLGWIGAP SMIEHLAWLR
LPGVFLQVVF GVTTWMVLRR LVDPMVALVA GFLWSTSPYL IAHGRVLHLD ALLTGLLTLS
LLLLLVSLRQ QQAGAGGWTA LLGSGALTGL ALLTKGPAII FLPFAGLMLF ALAPAKDASN
RRVSGVVSDV FRRLRYAIVR YGVWLGVALG VAFAGWPALW VTPEAALQAY VGEIIFNGGR
PNGDGQFFNG QAVGDPGVWF YPVASLFRTT SVMFIGLVAF VVFAVIDGRR FFTQRDAVIP
VLIAFAAFWT LVMTLGPKKF DRYVLPIWPV LLVLAATGIV RGYNAARAWC IRRAIVVPRG
GDFLKRAPLA GLLIMGAIEI GQVVWYHPYY LSYYNPLFGG GAAAQRMFLI GWGEGMDQVG
AWLSSRPDIG YGPVISALRP TLQPFVPVDV RDITDLGKLP VNYAVVYLES IQRGAHPDIY
RQFEPMTPIH TITIHGIEYA KIYQLPRPYR QPVGARFGDA IMLHGVSVEY DQNHLTVTPS
WGALAPPQGD YVVFLQVIDA QGQRVAGVDV PPSGVGGMPT GAWLPG