Gene Rcas_0849 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0849 
Symbol 
ID5538315 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp1111042 
End bp1113195 
Gene Length2154 bp 
Protein Length717 aa 
Translation table11 
GC content62% 
IMG OID640893001 
Productglycosyltransferase 
Protein accessionYP_001430984 
Protein GI156740855 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0226156 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCGAG TCGAACGAAA CCCTGCGCGC GGCGCGAGCG TATTCCTGGC GCCACAAAGC 
GCGCTAACGA TCGGCAGAAC ACTGCTGACT GCCCTGATAC TTGGCACGCT CATGCTCGCG
CCGCGCGCGT TCGGGCTTGC CGATTTTCTA ACGACCGACG AAGCGTACCA CTGGATTCGG
TTCACCGAAC GATTCGATGC TGCGCTCTCT GCCGGTCGCT GGGCAGATAC TATCTACGTC
GGGCATCCCG GCATCACCAT GTTCTGGTTG GGGCGCACCG GGTTAATGAT CGAGCGCATG
GTGCGCGATC TGGGCTGGAT CGGAGCGCCT GCCATGGTCG AGCATCTCGC CTGGTTGCGG
CTGCCGGGCG TGATCCTTCA GGCGCTGTGC GGCATGGCAA CATGGCTCTT GTTGCGCCGC
CTTGTCGATC CGACCGTCGC GCTGGTCGCG TCGTTCCTGT GGGCTACGTC GCCGTATCTG
ATCGCTCATG GTCGGGTCCT CCATCTGGAC GCGCTTCTGA CCGGACTGAT CACGCTGAGC
CTGCTCTTGC TGCTGGTTTC CTGGCGACAA CAACAGGCGG GCAAAGATGG ATGGACCGCG
CTGCTCGGTT CTGGAGCGCT GACCGGACTT GCGTTGCTGA CGAAGGGACC GGCGCTCATT
TTCCTGCCAT TCGCCGGTGT ACTCCTCTTT GCGTTCGCTC CCGCATATAC GGCGACATCC
CGGCATACAG GCGACATACT GCGCCTGGCA TCGGGCATCG CCCATCGCCT GCGGTATGCT
GCTGTGCGCT TCTGCGTATG GTTGGGCATA GCGATCTGTG TGGTCTTCGC GGGATGGCCC
GCGCTCTGGA CGGCGCCGGA CGCAGCGCTT CGCGCATACG CCGATGAAAT CATCTTCAAT
GGCGGACGCC CCAATGGTGA TGGGCAATTT TTCAACGGTC AGGCAATCGA CGATCCCGGC
GTATGGTTCT ACCCGGTCGC CAGCCTGTTT CGCACAACGC CGGTTATGCT GATCGGGTTG
ATCGTGTTCG GCCTCTTCGC CGGTCTGGAC GCCTGGCGTT TCTGGAGGCG AGGCGAAGCG
CCCTTCGATA GCAGATACCG CGTACTTGTC GTCTTCATCG CATTTGCCGC ATTCTGGACC
TTCATGATGA CCCTGGGCGC GAAGAAGTTC GACCGGTATG TCTTGCCCAT CTGGCCCTCA
CTGCTGGTCC TGGCAGCGAC CGGAATCGTG CGCGGGTATG GTGCTGCGCG GGCATGGTTC
GCCCGCCGTG CAACGACGAT ACAACGCGGC GGCGCGTGGC TCGCGCGTAT CCCGCTGGCG
ACGTTGATCG CGCTGGGCGG CGTTGAAGTT GGTCAGGTTA TCTGGTATCA CCCGTACTAC
CTGAGTTACT ACAATCCGCT CCTGGGTGGC GGTCCCGTCG CGCAGCGCAT GGTGCTTATC
GGATGGGGTG AAGGCATGGA TCAGGTCGGA GCGTGGTTGA GCGCCCGCCC TGATATTCGA
TACGGACCGG TGATCTCGGC ATTGCGGCCA ACGCTGCAAC CATTCGTGCC CGTCGATGTC
CGCGACATTA CCGACCTGGG AACATTGCCG GTCAATTACG CCGTTGTGTA CCTGGAGTCG
GTGCAACGGG GCGCGCATCC CGAGATTTAC CGGCAGTTCG AAGCGATGAC GCCGATTCAC
ACGATCACCA TTCACGGGAT CGAGTACGCC AGGATCTATC AGTTGCCGCG TCCATTCGCG
CAACCAATCC ATGCGTGTTT CGGTGATGAG ATTACATTGC ACGGCGTCAC GATCGAAGCC
TCGCCTGATC ATCTGTCGGT GACGCCCTCC TGGGGAGCGC TCGTATCTCC GACGCGCGAT
TACATGGTGT TTCTCCAGAT GATCGACGTA CAGGGGCGGC GTGTCGCCGG TGTTGATGTG
CCGCCCGCCG GAGTTGGCGG ACTGCCTGCG AGCGCATGGC TTGCGGGTCA GCAAGTGGCA
GTGCCGCTGC CGCTGCCGTT GCCGTCTGAC CTGCCGGCCG GAACATATGA GGTCGTCATC
GGTTTGTACG ATGCCAACAG CGGCGAGCGT GCGCCCGTCA GCGGCGGCGT CGCAGCCGAC
CCGGCGCGCG CCGGGCCGCA TGCGCTGCTT TTGACGACGC TGACGCTGCC GTAA
 
Protein sequence
MQRVERNPAR GASVFLAPQS ALTIGRTLLT ALILGTLMLA PRAFGLADFL TTDEAYHWIR 
FTERFDAALS AGRWADTIYV GHPGITMFWL GRTGLMIERM VRDLGWIGAP AMVEHLAWLR
LPGVILQALC GMATWLLLRR LVDPTVALVA SFLWATSPYL IAHGRVLHLD ALLTGLITLS
LLLLLVSWRQ QQAGKDGWTA LLGSGALTGL ALLTKGPALI FLPFAGVLLF AFAPAYTATS
RHTGDILRLA SGIAHRLRYA AVRFCVWLGI AICVVFAGWP ALWTAPDAAL RAYADEIIFN
GGRPNGDGQF FNGQAIDDPG VWFYPVASLF RTTPVMLIGL IVFGLFAGLD AWRFWRRGEA
PFDSRYRVLV VFIAFAAFWT FMMTLGAKKF DRYVLPIWPS LLVLAATGIV RGYGAARAWF
ARRATTIQRG GAWLARIPLA TLIALGGVEV GQVIWYHPYY LSYYNPLLGG GPVAQRMVLI
GWGEGMDQVG AWLSARPDIR YGPVISALRP TLQPFVPVDV RDITDLGTLP VNYAVVYLES
VQRGAHPEIY RQFEAMTPIH TITIHGIEYA RIYQLPRPFA QPIHACFGDE ITLHGVTIEA
SPDHLSVTPS WGALVSPTRD YMVFLQMIDV QGRRVAGVDV PPAGVGGLPA SAWLAGQQVA
VPLPLPLPSD LPAGTYEVVI GLYDANSGER APVSGGVAAD PARAGPHALL LTTLTLP