Gene Rcas_0469 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0469 
Symbol 
ID5537932 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp601840 
End bp603414 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content61% 
IMG OID640892632 
Productglycosyltransferase 
Protein accessionYP_001430618 
Protein GI156740489 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.171688 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.657168 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTCGTC CAGAAGCGCA CAGCGCTCCA CATCTTCAAA CAACGCCAAT GACCGTTGGC 
GCCTCTCGGA TTCTTGTGGC AACGCTGTTT ATCGTGGCAG TGTTGCTGGC GACGATCAAT
CTGCCATATG CGCCGCGGAC CTGGTTTGAT GAGGGATCGC ACCTGCACGT GCCGAAAGCA
TTGGTGCAGT ACGGCAAGTA CGCCGACATC AGCGCCATCC CTGATGGACG CATCGAGTTT
CGCTACCACG GACCCACGAT TGGTATCGGT CCGACCATTA TGCTGCCGGT TGCGGCGGTC
TACCAGGTGT TCGGTGTCGG TCTGACGCAA GGGCGACTGG TAATTGTGAT CTATTTTGCC
ATTGCGCTTG TTGCCGGATA TGCGCTTGCG CGTCGTCTGT ATGATCGCCA GACTGCACTG
ATCGCGCTGG CGCTCCTGCT GGCGTCACGC ACGGTCAATT ATGAGGGGCT GATCGAGTAT
GGGAGGCAGG TGCTCGGCGA GGCGCCCGGC GTGGCATTCG TCTTTCTGGG AATGCTGGCG
TGGCTGACGG CGTTGAAGAC AGCAGCGCAA CCGGCGCTGC GGCATGCGCA CCGGACGTGG
AGCATACTGG CGGGGCTGGG GTTTGGTCTG GCGTTGGTCA CCAAGAACCA ATTTGTGCTG
ATTATCCCGC CGGCGCTGGC GCTGACAGCG TTGCTCGACT GGCGCTACTA TCGGGCGGGA
ACCTGGACGC TGCGCCTGAT TCCGCCGATT GTTGCTGTTG GTTGTTTTGC GCTCTGGACG
GTCGTGCAGT TTGCGCTGCT CGGTCCTGGC ACATTCTTTG AAAATCTTCA GCAAACCCGG
CAGGCTGCTG GCGGGGCGAT TTTCGTTTTC AACCTGCGTT CGACCCTGCG CGCCGGGTAT
TACCTGTTGC GTCCCGATCT GTTCGGCGGG CTGGTTGTGC CGGCGCTGGC ATACACTATC
TGGCGCGCGC GCCGCCGCAC ATCGCAGGGG TTGAACGAAG CGCTGCTGGC ACTGATCATT
GGTCTCTGGC TGGCGTGGTT CGTCGGCGTT TCCCTCGGCT GGCCCCGCTA CGCCTTTCCG
GCAGTCGCAC TGAGCGCTCT GACCGTCGCG CGGCTGGCAT TCGATACGAT TGTCTGGCTG
CGCCGCGTGT TGCCGGCGGC AGCAACAATT GCCGCCATCT ACCTGGTTGT CATCATCGTG
CTGCCGATGG CACTAACAGT GCGCGTGGTG TTCACGCCCG ATGATAGTGC ACAGCGGTTC
GCCGCATATC TGAATGCGAA TGTGCCTGAA TCGGCGATCA TTGCTACCTG GGAGCCGGAA
TTGGGGGTGC TGACCGATCA CCGCTATCTC TACCCGCCCC AACCGACCCT GGATCAGGCA
GTGCGGCACA CCTGGCTGGG AGGTGATCCG GTGCGCTACG ACTGGTACGC AGATCGACCG
GAGTATGTTG TGGTCGGCAG TTTCGGCGGT TATACCGGCG TGTACCATAC GCCCGAACTG
GAACGCCACT ATATTCGTGT GGCGCAGATG GGTACGTATG CGTTGTACCA GGTTCGGGCG
GGGAGTGGGG AGTAG
 
Protein sequence
MIRPEAHSAP HLQTTPMTVG ASRILVATLF IVAVLLATIN LPYAPRTWFD EGSHLHVPKA 
LVQYGKYADI SAIPDGRIEF RYHGPTIGIG PTIMLPVAAV YQVFGVGLTQ GRLVIVIYFA
IALVAGYALA RRLYDRQTAL IALALLLASR TVNYEGLIEY GRQVLGEAPG VAFVFLGMLA
WLTALKTAAQ PALRHAHRTW SILAGLGFGL ALVTKNQFVL IIPPALALTA LLDWRYYRAG
TWTLRLIPPI VAVGCFALWT VVQFALLGPG TFFENLQQTR QAAGGAIFVF NLRSTLRAGY
YLLRPDLFGG LVVPALAYTI WRARRRTSQG LNEALLALII GLWLAWFVGV SLGWPRYAFP
AVALSALTVA RLAFDTIVWL RRVLPAAATI AAIYLVVIIV LPMALTVRVV FTPDDSAQRF
AAYLNANVPE SAIIATWEPE LGVLTDHRYL YPPQPTLDQA VRHTWLGGDP VRYDWYADRP
EYVVVGSFGG YTGVYHTPEL ERHYIRVAQM GTYALYQVRA GSGE