Gene Rcas_3567 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3567 
Symbol 
ID5541068 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4657124 
End bp4659151 
Gene Length2028 bp 
Protein Length675 aa 
Translation table11 
GC content63% 
IMG OID640895686 
Productglycoside hydrolase clan GH-D 
Protein accessionYP_001433634 
Protein GI156743505 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3345] Alpha-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.879137 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.341251 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCACT CTGGATCAGG CGCCACGCCT GAGACATGGA CGCTCTCGAC AGACACCGTA 
ACCATCACCG TGTCGCCGGC GGACGGGCGC CTGACCTGGA ATCTCAGCGG CGCAACTGGA
GGGCTTGACC TGTCATCGGT GACGGGGAGC GCTTTGCGTG TCGATGGACG GATGCCGCTC
TGGACGCAGA TTGTCGCTGT CGATGAGCAG TCCGTCGGAG ATGGCAGTCG CCTGTTGGCG
CTAACGCTCG ATACGGCGGA TGGCGGGCTG CGATTGATGC GCTGGTTTCA CCTGTTCGCC
GATCATCCGT TTGTGCGCAC CTGGGCGACG CTGGAGAATC GCAGCAGTGC GACGGTGCGG
ATCGATCAGT GCGATATTCT GACGCTGACG CCACACGCCC CGCCGCCACT CTATCTGTTC
CACGTCGAAC AGTTCAGTTG GAATTACCGG CGCGATTTCT TCAGTCAGCA CGAGGTCTGG
CTGCGGGTGG GATGCGCTCC GCACGAGATC CGCATGGGAT CGCACCCCGC CCACCACTGG
GGTCCCTCCA GTTGCGCGTG GTTCGCCCTG CGCGACGGTT CCCCCAACTG GAACGACGAG
CCGCCGGAGG GGGGGCGCGG CATGGTGTGC GGCATCGAGT TCAACGGCAA GAGCCGGTTG
CACGCCTGGG CAACCACCGA ACGGGTGAAT CTGGTGAGCC AGATCGATGA CCTGGCGCAC
CGCCTCGCAC CAGGCGCGAT CTTCGAGATT CCGGCGTTTT TCGTCGGGCG CTTCGAGGGG
GATTGGGACG AAGCCGGGTA TGTGACACAG CGTTTCGCCG AGGCGCATGT CCATCCCCCG
ATGCCCGACG ACCGCTACCC GTGGGTGCAG TACAATTCCT GGCGGTACGA GCAGAACATC
AACGAGGAGC AGCAACTGGC AGCCATCGAC CGCTGCGCGG AACTCGGCGT CGAACTTGTG
GTGATGGACC TGGGGTGGGC ACGTATGATC GGCGACTGGC GCCCCGACCC GATCAAGTTT
CCGCGCGGGC TGAAGCCGCT GGTCGAACGG GCGCACTCCT ACGGTATGCG GTTTGGCGTT
CATGTTGCGC TGGCGCAGTG CAACCCGGAA GCGCCGGTCG CCAGAGCGCA CCCCGACTGG
CTCATTCACA CCGGCAACGA TTACTATGGC GCCGGTCCGC TCTGCCTGGG ACACGAGCCG
TGCCGCCAAT GGCTCATCGA GCAGTTGATC CGGCTGGTGG ACGAAGAGGG GATCGACTAC
ATCATCCAGG ACGGCGAGGA TATGGTGAAG CGGTGCGAGC GCAGCGATCA CACGCATGCA
CCGGGCGACA GCAACTACGC CAACTCACAG TATGGGCTGG ACATCGTCAT CGAATCGCTG
CGCCGCGCCT GCCCACACCT GGTGCTCGAA AATTGCGAGG ACGGCGGATG CATGATGACC
TATAAAATGG CGCGTCTGTA CCATACCAGC ATCACGGTGG ACAATACTTC GTCCTATGCC
ACACGACAGG GGGTGTACGG CGCGTCTTAC CCGTTCTCGC CGCGCTATAG CGTGCGCTAC
ATGCAGGACG ATCCCTCGCC CTACACCCTG CGCAGTTCAA TTTTCGGCGG GCCTCTCATC
CTGATGCAGC GCGTCACCGA GTGGAACGAA GCGCAGATGG CGGAAACCAG GCGTGCTATC
GAACAATATA AAGCATTGCG CCATCTCATC CGTTCGGCGA AGATCATCCA TCTGAAAGCG
CCGCTGCACA ACATCGATGG TCTGGGATGG GGATGGGACG CTCTCCAGGC AGTGGCGCCC
GACCAATCCC GCAGCGTCGT GATGGTATAC CGCGCGCAGG GAGATGTCGC CGAGCGCACC
TTCAAGCCGC GCGGTCTGCT CCCAAACGCG GCGTATCTGG TCCGCTACGT CGATAGCGGG
CGCACCCTGC AACGCACCGG CGCCGAACTG GAGCGCGACG GGATCACCGT GGCGCTAGAA
GAGTTCAGTT CAGAGCTCGT TATGATAGAG GTGGAGGGTG AAAGGTAG
 
Protein sequence
MTHSGSGATP ETWTLSTDTV TITVSPADGR LTWNLSGATG GLDLSSVTGS ALRVDGRMPL 
WTQIVAVDEQ SVGDGSRLLA LTLDTADGGL RLMRWFHLFA DHPFVRTWAT LENRSSATVR
IDQCDILTLT PHAPPPLYLF HVEQFSWNYR RDFFSQHEVW LRVGCAPHEI RMGSHPAHHW
GPSSCAWFAL RDGSPNWNDE PPEGGRGMVC GIEFNGKSRL HAWATTERVN LVSQIDDLAH
RLAPGAIFEI PAFFVGRFEG DWDEAGYVTQ RFAEAHVHPP MPDDRYPWVQ YNSWRYEQNI
NEEQQLAAID RCAELGVELV VMDLGWARMI GDWRPDPIKF PRGLKPLVER AHSYGMRFGV
HVALAQCNPE APVARAHPDW LIHTGNDYYG AGPLCLGHEP CRQWLIEQLI RLVDEEGIDY
IIQDGEDMVK RCERSDHTHA PGDSNYANSQ YGLDIVIESL RRACPHLVLE NCEDGGCMMT
YKMARLYHTS ITVDNTSSYA TRQGVYGASY PFSPRYSVRY MQDDPSPYTL RSSIFGGPLI
LMQRVTEWNE AQMAETRRAI EQYKALRHLI RSAKIIHLKA PLHNIDGLGW GWDALQAVAP
DQSRSVVMVY RAQGDVAERT FKPRGLLPNA AYLVRYVDSG RTLQRTGAEL ERDGITVALE
EFSSELVMIE VEGER