Gene Rcas_3882 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3882 
Symbol 
ID5541388 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5077209 
End bp5078519 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content59% 
IMG OID640895993 
Producthypothetical protein 
Protein accessionYP_001433936 
Protein GI156743807 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3693] Beta-1,4-xylanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0281101 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGGAAAG CGTCATCGGC TATGCATCGT TCCAATATGT GCGTGTTGTC CTGGATTGCG 
CGATATGGCT TTATCATCGC ACTCCTCGCC GCCTGTGGAT CGGCGACGAC CAACCCGCTT
ACCCGCCCTG ATATTGATGA AGTGCGCCCC ACCACCGCAG CGCAGCCGAC GTTGACCACA
GCGCCGACAC CGACGAGAGC GCCGGTCTCT GGAACGCCGC AACCGCGCTT TCTGCGCACA
TCTGCGCTGC AATTCGGCGT AGTGGCGCAT CTGTACTATA CCGACCGCAG CCGGGTGCTG
ATGCTGACGA AGATCGCTGG CTTCGACTGG GTGCGACAAC AGGTACACTG GAAAGATATC
GAAGCGGCGC CGGGCGTCTA TTACTGGGAT GAACTGGACC ACATCGTCGC CGATGTGTCT
GCCAGCAACC TGAAATTGCT GGTGAACATC GTGCAATCGC CGCCATTCTA CCCTCCCGGC
AATGGCGGCA AGCCGCGCGA CCCGAAGGTT ATGGGCAATT TTGTGGCAGC AATGGTGGAA
CGATATGGAG ATCGCATTGC GGCTATCGAG ATCTGGAACG AGCCAAACCT GGCGGTCGAG
AATGGCGGGC GCGTTACGCC TGAGGATCCG GGACGGTACG TTGAGATTCT GGCAGAGTGC
TATCGCCGCA TCAAGGCGAT CAACCCCAAT ATCTACGTGC TGGCGGCAGC GCCGGCATCG
ACCGGCGTGT TCGACCCGGA GCGCGCCATT CCCGATATTG AGTACCTGCG CGCCATGTAC
ACCTACAAGA ACGGCATGAT CCGTGACTAT TTCGATGCGC AGGCAGCGCA CCCCGGCGGC
GCAGCGAACT CCCCCGACTG GCTCTATCCC GAATCTCCCG GCAATCGTCC GGCGTGGAAC
GACCATCCGA CCCATTACTT CCGCCACGTC GAAAATGTGC GCGCGTTGAT GATCGAACAC
GGTCTCGGCG ACCGGCAGAT CTGGATTACC GAATACGGAT GGGCGACGCC AAATACGACA
CCGGGATTCG AGTTTGGCAA CTTGATGACG TTCGACGATC AGGCAGACTA TATCGTTCGG
GCGATCACGC GAGTCTATGA GCAGTATCGT GACGAAGAGG GGCGTCCGTG GGTCGGGGCG
ATGTTTTTAT GGAATATGAA CTTTGCCGTC CTGTGGGGTG CACAGGGCAA CCCGAACCAC
GAGCAGGCAT CGTTTAGTCT GCTGAATCCC GACTGGAGTC CTCGTCCGGC ATTTATTGCA
CTTCAGGGGT TGCATCAGCG TCTGAAAGTA TCTCAGGGGC GGTCGCCATA A
 
Protein sequence
MRKASSAMHR SNMCVLSWIA RYGFIIALLA ACGSATTNPL TRPDIDEVRP TTAAQPTLTT 
APTPTRAPVS GTPQPRFLRT SALQFGVVAH LYYTDRSRVL MLTKIAGFDW VRQQVHWKDI
EAAPGVYYWD ELDHIVADVS ASNLKLLVNI VQSPPFYPPG NGGKPRDPKV MGNFVAAMVE
RYGDRIAAIE IWNEPNLAVE NGGRVTPEDP GRYVEILAEC YRRIKAINPN IYVLAAAPAS
TGVFDPERAI PDIEYLRAMY TYKNGMIRDY FDAQAAHPGG AANSPDWLYP ESPGNRPAWN
DHPTHYFRHV ENVRALMIEH GLGDRQIWIT EYGWATPNTT PGFEFGNLMT FDDQADYIVR
AITRVYEQYR DEEGRPWVGA MFLWNMNFAV LWGAQGNPNH EQASFSLLNP DWSPRPAFIA
LQGLHQRLKV SQGRSP