Gene Rcas_3938 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3938 
Symbol 
ID5541444 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5138222 
End bp5140036 
Gene Length1815 bp 
Protein Length604 aa 
Translation table11 
GC content63% 
IMG OID640896046 
Producthypothetical protein 
Protein accessionYP_001433989 
Protein GI156743860 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3664] Beta-xylosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCTCGA CATTGCCGTT CCGCATTGAC CGCATTCTTT CCATTGTGGT GACGACAGCA 
CTGCTGCTGG CGATGTTGCC GGCAGTTGCG CCGCCTGAAA CCTCGCTCGT TGTCCAGGAA
ACGTCCCGGG CAACGCTGCG TGATTCGCCC TTCGGCGTTA ACAGCCATCT GGCAACACGC
TACTGGGACC CTGCATCAAT GCATGTGCCG GCCGACGTCG TCGCGCGCCT GGGGGTCGGT
TGGGCGCGTG AGGATTTCCA CTGGTTTCGA ATCCAGCCAA CGCCCGATGC CCCTTACGAC
TGGACATTCA CCGATGAAGC AGTGCGCGCG CTCAATCGAC GTGGAGTGAA CATTCTGGGA
GTCATCGGGC ATCCGCCGGG ATGGGCAACC CCGTTCCCCG GCGACATTCC GCACGGCGTG
TCGTTCTACG CTCCCGATCC GCAACGGTTT GCCGCCTTTG CCGCCGCCGT TGCGCAGCGC
TACCGCAACT ATATCTCCCA TTGGGAAATC TGGAATGAGC CGGACAATCC GTTGTTCTGG
AAGCCTGCCC CTGATCCTGT CGCCTATGCC ACACTCCTGC GCCTCACTTC CGCCGCGATC
CGATCGGTCC ATCCAGAGGC GACCATTCTG ATCGGCGGGG TCTATTCGTT CGAGCCATCG
TTTCTGCGGC AGGTCGCCGA AGCAGGCGCA TGGCAGAGTT TCGACATTCT TGCCATCCAT
CCGTATGTCA GCCCAAGCGC GCCAGAGATC GGCAACCTGG TCGCCGGCGT CGAAGCGGCG
CGTTCCGTCG CCGAGCAGTA CGGCGCGCGC CCGATCTGGG TCACTGAGAT TGGGTGGTCG
AGCGGGCGCG GCGACCGCGA CCCCGTTGGA CTGGTGAATG AGCAGGATCA GGCCAATTTT
CTGGTGCGCT CAATGTTATT GCTCTGGAGC GCCGGTGTTG AGAAGGTTTT CTGGTATACG
CTCAAGGACG ACCCCGGCAA CCCCTATGGT CTGGTAGGGA CTGGCGTTGG TTATTTCGAC
TATAGCCGCC TGAAGCCGTC GTTTACCGCA TATCGGGTGA TGGTGGAGAA TCTAGCGGGC
GCCGACCTGG TGGTCGTGCG TGATCTGTTC AACCGAACTA CCGTGCTCGA CTTCGAGCAG
TTTGGTTCAT GGCGGCGCGG CGATCAGGCG TATGGCGATC TGACGCCGAC CGGTGAGCGC
GCGCGCAGCG GGCGCGGCGC GGCACAGTTG CGCTACACAT TCGCTTCGCG CACCAATGAG
TTCATCGTCT TTCGGCGCGA ACGCCCGGCG CCCATTCCCG ACGGCGCTTA CGCGCTGGGG
CTGTGGGTGT ACGGCGACGG TTCAGGAAAC ACGCTCAAAG TCTGGGTGCG CGATGCAGAA
GGGGAAGTGT TGCAGTTTAC CCTGGGTGCA ATCGGACCGC CGGGATGGCG CATTCTGGAG
GCGCCAATCG TGGGCGTTGC GCCGGAATGG GACCGGATCA GCGGCAATGG CAACGGGCGG
GTCGACTTTC CGGCTCGCCT CGACGCGATT GTGCTCGACG ATGCACCGGA TGACCTTGCC
AGCGGAGGGA CCATCTATCT CGACGACCTG TTCGCCGTCA GCGGACCGGA AGCGTATGAC
GCACAGTTTC AGCGCGGCGA TACAACCATC GACGTGTTGT GGGCGCCGGC GCCGGTACGC
GCCAGCATCC GCACCGGCGC ATCGACCGCC GCCCTGATCA CACGCGATGG CGGCTCATCG
ACGATTATCG CCAGCGAGGG TCGCCTGGTC ATCGACCTCG GACCGGCGCC GGTGTACGTG
GTTCATCGGC GATGA
 
Protein sequence
MRSTLPFRID RILSIVVTTA LLLAMLPAVA PPETSLVVQE TSRATLRDSP FGVNSHLATR 
YWDPASMHVP ADVVARLGVG WAREDFHWFR IQPTPDAPYD WTFTDEAVRA LNRRGVNILG
VIGHPPGWAT PFPGDIPHGV SFYAPDPQRF AAFAAAVAQR YRNYISHWEI WNEPDNPLFW
KPAPDPVAYA TLLRLTSAAI RSVHPEATIL IGGVYSFEPS FLRQVAEAGA WQSFDILAIH
PYVSPSAPEI GNLVAGVEAA RSVAEQYGAR PIWVTEIGWS SGRGDRDPVG LVNEQDQANF
LVRSMLLLWS AGVEKVFWYT LKDDPGNPYG LVGTGVGYFD YSRLKPSFTA YRVMVENLAG
ADLVVVRDLF NRTTVLDFEQ FGSWRRGDQA YGDLTPTGER ARSGRGAAQL RYTFASRTNE
FIVFRRERPA PIPDGAYALG LWVYGDGSGN TLKVWVRDAE GEVLQFTLGA IGPPGWRILE
APIVGVAPEW DRISGNGNGR VDFPARLDAI VLDDAPDDLA SGGTIYLDDL FAVSGPEAYD
AQFQRGDTTI DVLWAPAPVR ASIRTGASTA ALITRDGGSS TIIASEGRLV IDLGPAPVYV
VHRR