Gene Rcas_2449 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2449 
Symbol 
ID5539930 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp3146131 
End bp3147492 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content59% 
IMG OID640894579 
Productglycoside hydrolase family protein 
Protein accessionYP_001432547 
Protein GI156742418 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.234519 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCACCG CCATTAAGAT CAGCGTGATC GGCGCCGGCA GCGCCCAATT CTCGCTGGGG 
CTGGTCAAAG ATCTGTGTCT GACGCCAGGA CTCGCAGGCA GTCTGGTCAG TTTCATGGAT
GTCGATCTTG CCCGCCTTGA GATGATCGAG AAACTGGCGC GCCGCTATGC CGCCGAACTG
GGCTCCGATC TGCGCTTCGA GCGCACTGCG GATCGCGCCG CGTCGCTCAC CGACGCCGAT
TTTGTGATCA ATACCGCATC GGTCGTCAGC CACCATCATC AGCGCGCCAT GCGCGAGGTG
ACGGCGAAGC ACGGCTACTA TTACGGCGGC GTTGCTTTCG GCAATCACGC GCAACTGGCG
TTCATGCTCG CCGTGGCGCG CGATATGGAG CGCATCTGTC CAAATGCCTG GCTGATCCAG
TCGGGTAATC CGGTGTTTGA AGGATGCACG CTGATGACCC GTGAGACCGG CATTAAGGTG
TGTGGTCTGT GCCATGGTCA CTACGGTGTC TATCAGGTGG CTATGACCCT GGGATTAGAC
CCACAGAAGA TCACCTGGCA GGCGCCGGGG TTGAACCACA ATATCTGGCT GACCCATTTT
CTCTACGAAG GGAAAGACGC CTATCCGCTG CTCGACCGCT GGATCGCCGA ACAGAGCGAG
ACCTTCTGGC GCACCCATGT GGCGCAGAGC ACCCACGACA TTCAGATGTC GCGCGGCGCC
ATTCATATGT ACCGCATGTA TGGGTTGATG CCAATCGGCG ATACGCCGCG CCAGCAACGC
AACTGGTGGT ATCACACGAG CCTTGAGGTC AAGAAATACT GGTTCGGCGA ACCGTGGGGC
GGTCCCGACA CCGAGATTGC CCGTCCATTC TTCGTCGAGG GTCTGGAGAA GCGCATCGCG
TTCATGACTC AGCTGGCGAA CGATCCGAAA GCCAGCCTGG TCGAAACGTT CGGCAGCGAG
AAGACGCGCG AGCAGCAGGT TCCGATCATC GATGGGCTGG TCAACAACAA CGAGTATGTC
GCTCAGGTGA ACATTCCGAA CCACGGCGCG CTGCCCGGCG TTGCCGATGA TGTGGTGGTC
GAGGTTCCAG CAATCATCAA TGCCAAAGGT ATTCAACCGC TGCGCGTGCC ACCGTTGCCA
CGGAAGATTA TGCTCGAAAT GATTCTGCCC GAAGTGCTCG ATATGGAACG CGAACTCCTG
GCGTTCAAAA CCGGCGATCG CTCGATGTTG CTCTGGAGTG TGCTCAACAG TCCGCAGACC
CGCTCATATG AACAGGCGGT CGCCGTGCTC GATGATCTGC TGGCGATGCC CGGTCATGAG
GAACTGGCAA CGCATTTCCG GTGGCCGGAG ACATGGGAGT AA
 
Protein sequence
MSTAIKISVI GAGSAQFSLG LVKDLCLTPG LAGSLVSFMD VDLARLEMIE KLARRYAAEL 
GSDLRFERTA DRAASLTDAD FVINTASVVS HHHQRAMREV TAKHGYYYGG VAFGNHAQLA
FMLAVARDME RICPNAWLIQ SGNPVFEGCT LMTRETGIKV CGLCHGHYGV YQVAMTLGLD
PQKITWQAPG LNHNIWLTHF LYEGKDAYPL LDRWIAEQSE TFWRTHVAQS THDIQMSRGA
IHMYRMYGLM PIGDTPRQQR NWWYHTSLEV KKYWFGEPWG GPDTEIARPF FVEGLEKRIA
FMTQLANDPK ASLVETFGSE KTREQQVPII DGLVNNNEYV AQVNIPNHGA LPGVADDVVV
EVPAIINAKG IQPLRVPPLP RKIMLEMILP EVLDMERELL AFKTGDRSML LWSVLNSPQT
RSYEQAVAVL DDLLAMPGHE ELATHFRWPE TWE