Gene Rcas_1884 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1884 
Symbol 
ID5539362 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2418834 
End bp2420240 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content59% 
IMG OID640894021 
Productglycoside hydrolase family protein 
Protein accessionYP_001431992 
Protein GI156741863 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0104687 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCCAA AGATTGTCTT CATCGGCGCC GGCAGCACGG TGTTCGCCAA AAACTTGATG 
GGCGACATCC TGAGCTTCCC CGAACTTGCC AACGCCACGC TGACGTTGTT CGACATTGAC
CCGGAGCGCT TGCGTACATC CGAAGTCGTG GCGCACAAGG TCGCTGCGGC GCTCGACGCG
CGCCCGACGA TTGAGGCGAC GACCGACCGA CGCCGCGCGC TCGATGGCGC CGATTATGCC
ATTTGCATGA TCCAGGTCGG CGGTTATAAG CCGTGTACGG TGACCGATTT CGAGATCCCG
AAGAAGTATG GTCTGCGCCA GACGATTGCC GATACCCTGG GCATCGGCGG GATTATGCGC
GGGTTGCGCA CCATTCCAGT GTTGCTCTCG ATATGCCGCG ATATGGAAGA GGTGTGTCCC
GATGTGACGT TCTTGCAGTA TGTCAATCCA ATGGCCATGA ATTGCTGGGC GATCAGTCGC
GCCAGCACGA TTAAGACGGT CGGGTTGTGC CACAGTGTGC AGGGCACTGC CGAGCAACTG
GCGCACGACA TTGGCGTGCC GGTAGAGGAG ATCAACTATG TCTGCGCTGG CATCAACCAT
ATGGCGTTCT ACCTGCGCTT CGAGCGAAAC GGCGAAGACC TCTATCCGCT CATCCGCAAG
GTCTACGACG AAGGGCGCGT GCCGGCGTGG AATCGGGTGC GCTACGAAGT GTTCCGACGC
CTTGGCTATT TTGTGACCGA GTCGAGCGAG CACTTCAGTG AATACGTGCC CTGGTTCATC
AAGCGCGACC GACCCGATCT GATCGAGCGC TTTAATATTC CGCTCGATGA ATACATCCGC
CGTTGCGAAG TGCAACTGGC GGGCTGGGAG TTCGTGCAGT ACAAACTCGA TCATCCAGAG
ATTGCGAGTT CCGATCTGCG CGCTGCGATG CGCGAACGTT CCCGCACCAA TCCGTACCTG
ACGGGCGAGT TCATCGACTA TGTCATCACA TCTGCCGAAG ACCTCGAGGC AGGGAAGGTC
GAGCGCAGCC ACGAGTATGG CTCGCTGATC ATCCACAGCC TGGAAACGGG GCAACCGCGC
GTGGTCTACG GAAATGTGCC TAACTATGGG CTGATCGATA ATCTCCCACA GGGGTGCTGC
GTCGAAGTTC CGTGTCTGGT GGACAAAAAC GGCGTTCAGC CGACAAAGAT CGGCGCACTG
CCGCCGCACC TCGCTGCGCT CATGCAAACG AACATCAACG TGCAGGCGTT GACCGTCGAA
GCGGCGCTGA CCGGCAAACG CGAGCATATC TACCACGCGG CGATGCTCGA TCCGCACACT
GCCGCTGAAC TCGATCTGGA TCAGATCTGG GCATTGGTGG ATGACTTGAT CGCCGCCCAC
GGCGACTGGT TGCCGGAGTA TCGGTAG
 
Protein sequence
MPPKIVFIGA GSTVFAKNLM GDILSFPELA NATLTLFDID PERLRTSEVV AHKVAAALDA 
RPTIEATTDR RRALDGADYA ICMIQVGGYK PCTVTDFEIP KKYGLRQTIA DTLGIGGIMR
GLRTIPVLLS ICRDMEEVCP DVTFLQYVNP MAMNCWAISR ASTIKTVGLC HSVQGTAEQL
AHDIGVPVEE INYVCAGINH MAFYLRFERN GEDLYPLIRK VYDEGRVPAW NRVRYEVFRR
LGYFVTESSE HFSEYVPWFI KRDRPDLIER FNIPLDEYIR RCEVQLAGWE FVQYKLDHPE
IASSDLRAAM RERSRTNPYL TGEFIDYVIT SAEDLEAGKV ERSHEYGSLI IHSLETGQPR
VVYGNVPNYG LIDNLPQGCC VEVPCLVDKN GVQPTKIGAL PPHLAALMQT NINVQALTVE
AALTGKREHI YHAAMLDPHT AAELDLDQIW ALVDDLIAAH GDWLPEYR