Gene Rcas_2554 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2554 
Symbol 
ID5540036 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp3294633 
End bp3296069 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content60% 
IMG OID640894683 
ProductAlpha-L-fucosidase 
Protein accessionYP_001432650 
Protein GI156742521 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3669] Alpha-L-fucosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0869924 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.838653 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGTACG AACCAACGCT CGAATCTGTC CGCGCGCACA TCGTTCCCGA CTGGTTTCAC 
GATGCGAAAC TGGGGATTTT CATTCACTGG GGTCTCTACT CCGTCCCCGG ATGGGCGCCG
ACGACAGGAC CGCTCCACGA TGTCGTGGCG AAAGAAGGCT GGAAAGAATG GTTCAGGCGC
AATCCATACG CCGAGTGGTA CATGAACTCA TTGCGCATTC CCGGCAGCCC GACCGTCGAT
TACCACGCGC AGCATTTTGG CGCCGATTTT CCATACGAAC GTTTCGCCGA GACGTTCAAC
CGTGAGACGG CAGCGTGGGA CCCGGCAGTC TGGGCGGATC TGTTCAAGCG CGCCGGAGCG
CAGTACGTTG TGCTGACCAC CAAGCACCAC GACGGTTTCT TGCTCTGGCC CAGCGCGCGC
CCCAATCCGT TCCGCGCAGG GTACCACGCC ACACGCGACC TGGTTGGCGA TCTGACCAGC
GCCGTGCGTG CCGCAGGGCT GCGCATGGGT CTCTATTACT CCGGCGGTCT CGATTGGACG
TTCAATGACC GGGTGATCGC CGATATTGTT GACCTCTTCA TGGGCGTGCC ACAGCAGTTG
GAGTACGTCG AGTACGCCAA TGCGCACTGG AAGGAATTGA TTGATCGCTA TCAGCCCAGC
ATACTCTGGA ATGACATCGG CTATCCGGCA GCAGCGAACC TGGTGGAGTT GTTTAGTTTC
TACTATAACA CTATTCCCGA AGGGGTCATC AACGACCGCT TCACGCAGGC GTCCATCGGC
GACGGTCCGC CTGATCCGGC AGCGATCATG CAGGCGCTGG CGCAGGGCAT GCTTCCCACA
CCGCCGCACT TCGACTTCCG CACCCCAGAG TATGCGGTGT TCCCCGACAT CAAAGCCGAA
AAGTGGGAGT CGTGCCGTGG TTTGGGATAC TCGTTCGGCT ACAACCGCAA CGAAACGGTT
GACGATATGC TTTCGCCCGT CAAGTTGATC CGATCATTCG TTGACATCGT CAGCAAGAAT
GGCAACCTGC TGATCAATGT CGGTCCGATG GGTGACGGAA CGATTCCGCC GGAGCAGGCG
GAGCGACTCG AAGCGCTCGG CGCGTGGCTC TCGGTCAACG GTGAAGCGAT CTACGGTACG
CGCCCCTGGA CGCGCGCTGA GGGCGTGACC GACAGCGGCA TCGGGATGCG ATTTACGCGC
AAGGGCGACG ATCTGTATGC GATCCTCCTT GATACGCCGC AGCAGCCCAA CGTCACCCTG
CTTGACGTGA CCGTTTCTCC AGGATCGCGT GTGATTCTGA CCGGTTACGG CGAGATTGCA
GCGGCACAGC ACGACCACGG TTTGACCATA ACCTTGCCTG CGTCGCTCGC CGATGCGCCT
GCGCACGCGA TCCGCATTGT TGGCGGTGCA TCGGCTGCGT CGGGAGGCGG AAGGTAA
 
Protein sequence
MMYEPTLESV RAHIVPDWFH DAKLGIFIHW GLYSVPGWAP TTGPLHDVVA KEGWKEWFRR 
NPYAEWYMNS LRIPGSPTVD YHAQHFGADF PYERFAETFN RETAAWDPAV WADLFKRAGA
QYVVLTTKHH DGFLLWPSAR PNPFRAGYHA TRDLVGDLTS AVRAAGLRMG LYYSGGLDWT
FNDRVIADIV DLFMGVPQQL EYVEYANAHW KELIDRYQPS ILWNDIGYPA AANLVELFSF
YYNTIPEGVI NDRFTQASIG DGPPDPAAIM QALAQGMLPT PPHFDFRTPE YAVFPDIKAE
KWESCRGLGY SFGYNRNETV DDMLSPVKLI RSFVDIVSKN GNLLINVGPM GDGTIPPEQA
ERLEALGAWL SVNGEAIYGT RPWTRAEGVT DSGIGMRFTR KGDDLYAILL DTPQQPNVTL
LDVTVSPGSR VILTGYGEIA AAQHDHGLTI TLPASLADAP AHAIRIVGGA SAASGGGR