Gene Rcas_0598 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0598 
Symbol 
ID5538061 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp796230 
End bp797735 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content59% 
IMG OID640892759 
ProductAlpha-N-arabinofuranosidase 
Protein accessionYP_001430745 
Protein GI156740616 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3534] Alpha-L-arabinofuranosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.832453 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTACCA GACCACCTGC ACGCATCTCC ATCGATGAAG AGCGCGTCAT TGGTCGCATT 
TCGCCGCTCC TGTTCGGCGG ATTCATCGAA CATATGGGGC GCTGCGTCTA TCAGGGCGTG
TTCGATCCCG GATCGCCGCT GGCTGATGAC CAGGGGTTCC GCACTGATGT GCTGGCAGCG
CTGCGCGAAC TGAACCTGCG GATCATTCGC TACCCTGGCG GCAATTTTCT GTCTGGCTAT
CACTGGCGTG ATGGTGTGGG ACCGGTCGCA CAGCGTCCGC GTCGCCGCGA ACTGGCGTGG
CAGTCGATTG AAACGAACCG CTTCGGCACA CACGAGTTCA TTGCGTTGTG TCGTATGCTT
GGCGCTGAAC CAATGCTCGG CGTCAACCTG GGGACCGGTA CGATCGAAGA GGCTGGCGCA
TATGTCGAGT ACTGCAACGC TCCAACCGGC ACAATCGAAG CCGATCGGCG AGTTGCGAAC
GGCGCACCGG AGCCGTTTGG CGTGCGCTAC TGGTGTCTTG GCAATGAGAT GGACGGACCC
TGGCAGATCG GTCATATGGA CGCGCACGCT TACGCTGTCA AAGCCCGCGA AGCCGCAAAA
CTGATGAAGT GGCACGATCC GTCGATCCGC CTGACGCTCT GCGGTTCATC GAGCAGCGGT
ATGCCGACCT ATCCCGAATG GGACCGAATT GCGCTCGAAG TGTGCTGGGA GTATGTCGAT
TATCTGTCGC TCCACTTCTA CGCGGGCAAC CGCGATGACG ATACTGACAG TTATCTGGCG
CTGGCGCGCC AGTTCGAGGA GCATCTCGAC GCTCTCGCCG GGACATTGCG CTATGTGAAG
GCAAAGATGC GATCACGTCA TAGTGTCTAT CTGAGCTGGG ATGAGTGGAA TGTGTGGTAC
AAAGACCAGA CAACGCAAGG GGGATGGCGC GAAGCGCCAC ACCTGATCGA GGAGGTGTAC
AACCTGGAAG ACGCACTGGT CGTAGCGCAG TGGCTGAATG TGTTCCTGCG CCGCTGCGAT
GTGCTGAAGA TCGCCTGCCT GGCGCAACTG GTCAATGTTA TCGCGCCCAT TCTGACCCGT
TCTGATGGGT TGATCCGTCA GTCGATCTTC TATCCGTTCG CGCTTTTCAG CCGGTATGCA
ACCGGCGACT CGCTCGACCT GCTCGTCCGG TCGCCGCTAT ATGCCACTCG CGCCTTCGGC
GATCAGCCCC TGATCGACGC AGCAGCCAGC TACGATGCTG AACATGGCAA GGGCGCCATT
TTTGTGGTTC ATCGCGGACA ACATGCGCCG CTAACGGTGA ATCTGGAGTG GCAGGGGCGT
TCGCCACGCC AGATCACGGA GATCTATCAG GTTGCCGGTG ATGATCCAAA AGCCGTCAAT
TCCTTCGAGC GACCCGATGT TATTGGCGTG CGCGCCCTGC CCGGCGCTCC GATCACCGAC
AGGCGGTTCA GCCTGAATCT CCCTCCACTC TCATTGACGG TAGCGCTGGT CGAATGGCCG
ACCTGA
 
Protein sequence
MTTRPPARIS IDEERVIGRI SPLLFGGFIE HMGRCVYQGV FDPGSPLADD QGFRTDVLAA 
LRELNLRIIR YPGGNFLSGY HWRDGVGPVA QRPRRRELAW QSIETNRFGT HEFIALCRML
GAEPMLGVNL GTGTIEEAGA YVEYCNAPTG TIEADRRVAN GAPEPFGVRY WCLGNEMDGP
WQIGHMDAHA YAVKAREAAK LMKWHDPSIR LTLCGSSSSG MPTYPEWDRI ALEVCWEYVD
YLSLHFYAGN RDDDTDSYLA LARQFEEHLD ALAGTLRYVK AKMRSRHSVY LSWDEWNVWY
KDQTTQGGWR EAPHLIEEVY NLEDALVVAQ WLNVFLRRCD VLKIACLAQL VNVIAPILTR
SDGLIRQSIF YPFALFSRYA TGDSLDLLVR SPLYATRAFG DQPLIDAAAS YDAEHGKGAI
FVVHRGQHAP LTVNLEWQGR SPRQITEIYQ VAGDDPKAVN SFERPDVIGV RALPGAPITD
RRFSLNLPPL SLTVALVEWP T