Gene Rcas_0791 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0791 
Symbol 
ID5538257 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp1034531 
End bp1035559 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content59% 
IMG OID640892943 
Productglycosyl transferase family protein 
Protein accessionYP_001430926 
Protein GI156740797 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.965247 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAAGGGA CCGAACTGAC AAAAGACGAT GACGGACTGC GACCACTCCT GTCAACCGTC 
ATTGTCAACA GTGACGGTTG CGCCGACACA CTGCGCTGCA TGGAGTCGCT CTGTGCGCAT
CCGCCGGACC TTGCGGCGCT GGCGCCGCGC GCTGGCGCGC TGCCGGAGCA CGAGATCATT
CTGGTAGACA ATCGGTCACG CGATGGATGC GTGACGCTGG TGCGCGAGCG TTTTCCCGCA
GTCCGCATTC TTGAGTCACC GGCGCGCCAG GGATTTTCAA AAAACTACAA TCTTGGGATT
CGACACGCGA ACGGTGTATT TGTTCTGGCG CTCAATAACG ACACCCTCGT TCATCCTGGC
GCCCTGACGA CGCTGTTGAA CGCCATCCTT GACACTCCTG CGTATGGGAT GGTTGGTCCG
CGCCTGATTG GGCGCGATGG CTGGGTGCAG GCGGTATGCG CACGCCCGGT GCTGACGCCT
CTGGAATACC TCCTGACGCA ATCAATTGCC GATCCTGGCT TCCCTATCGG ACGGTTATGG
CTGCTCTCCC AGCAGATAAG CGTGGCGCGT CGGCGGAGCG GTTCTGTCCC TTGCATCAGC
GGCGCGTGCA TGCTGGTGCG ACGCACGGCG TTTGAACAGG CAGGATTGTT CGATGAAGCG
GTCGATTTCT ACTACGAAGA TATCGAATGG TGTCACCGTA TGGCATGCCA TGGCTGGCAG
GTAGGGTATG TCGCGGAAGC GACCATTACC CATCTCGGCG ATCAATCGAT CAGAAATGTC
AAAGTCTGGG CAAAGAAGAG CGAATACTTC AGCGCGCTGC GCTACTTTCG CAGGTACCAC
GGACTCACCG ATGAAGGGGC GCGTGTGCTG CGCGCAGCGA CCTCGTTTGG CTGGTTGATG
CGCGGCATTG CTTTCGTGCT GGCTGAAGGG TTGTTCGGGC TGAAAGGGCA CGCGCGCGCC
TATCTCTATC TCTGGCACTG GATTCTGCGC GATCCGTCCG CCAGACCTGA CCGGGTAGAG
AGTCAATGA
 
Protein sequence
MQGTELTKDD DGLRPLLSTV IVNSDGCADT LRCMESLCAH PPDLAALAPR AGALPEHEII 
LVDNRSRDGC VTLVRERFPA VRILESPARQ GFSKNYNLGI RHANGVFVLA LNNDTLVHPG
ALTTLLNAIL DTPAYGMVGP RLIGRDGWVQ AVCARPVLTP LEYLLTQSIA DPGFPIGRLW
LLSQQISVAR RRSGSVPCIS GACMLVRRTA FEQAGLFDEA VDFYYEDIEW CHRMACHGWQ
VGYVAEATIT HLGDQSIRNV KVWAKKSEYF SALRYFRRYH GLTDEGARVL RAATSFGWLM
RGIAFVLAEG LFGLKGHARA YLYLWHWILR DPSARPDRVE SQ