Gene Rcas_3105 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3105 
Symbol 
ID5540601 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4023107 
End bp4024372 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content61% 
IMG OID640895224 
Productglycosyl transferase family protein 
Protein accessionYP_001433177 
Protein GI156743048 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.359614 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGGC GATTCTTCTG GTTCTGCGTG ACGCTGATCG GGTATGTCTA TGCCGGCTAT 
CCGGCGCTGT TGACCGTACT GGCGCGGTTG CGTCCGCAAC CGTTGTTTGC GCCGCCTGCC
GATCTGCCGA TAGTGACGCT CCTGATCGCG GCATACAATG AGCAAAACGT GATTGCTGCC
AAACTGAGCA ATAGTCTGGC GCTCGATTAT CCGCGCGACA GGCTCCAGAT TCTGGTGGCT
GCCGATGGGT CGGATGATGC CACGCCCGAC ATTGTCGCCG ATTTTGCCGA TTGTGGCGTC
GAATTGAGTT ATCGCCCCGA GCGCGCCGGG AAACTGGCGG CGATCACCCG TGCGCTCGCG
CTGGCGCGTG GTGAGATTAT CGTGTTGTCC GATGCGAATA ACCTGTACGA CGCAGGCGCA
TTGCGGGCGC TGGTCGCGCC ATTTGCCGAT CCGAGCGTCG GAGCGACGAC AGGCGCCAAA
GTGATTGCGA AGGGCGACGG AGCGCTTGGT GACTCGGAAG GGTTGTACTG GAAGTACGAG
TCGTACATCA AGCGCCAGGA GACGCGACTG AGCAGTTGCA CCGGCGCAGT TGGCGAAATT
ATGGCGGTGC GACGCGGGTT GCTCGATCAG CCGTTGCTGC CGGAGGCGCG GTTGATGGCA
GACGATCTGG CGCTCGCCAT GCATGTGCTG AAACAGGGGT ATCGCGTGGT ATACATACCC
AACGCGCGCT CAATCGAGCG GGTATCTGCT TCGGCGCAGG ACGAGCAGGA GCGTCGGGCG
CGAATTGTGG CGCAGCGTTT TGTGCTGATG CGGCACTCGC ACAGGATGTT GCCGCTGTTG
AATCCGCTGC TCGTCTGGCA GATTGTGTCG CATAAGTACC TGCGCCCGTT TGTGCCGCTG
GCGATGATCG GCGCGCTGCT TGCCAATCTG GCGGCGGTGA TTCGTCCGGC GGCGCAGGGG
GGGATGCTGC GGCTGGCGTC CCCCTTCAAC TGGGTGATGC TGGCGTTGCA GGCAGTGTTC
TATGCGCTGG CATGGATGGG AGGGCGCAAC GAATGTCGCG GCATATGGGG AAAAGCGCTG
TATATTCCGG CGTTCCTGGT GAATGGCAAT CGCGCGGCGC TCGTGGGACT GTACCGTTTT
CTGACCGGGC GCCATACCTC GCTTTGGAAT CGTGTTCAGC GGCGTGAACG TGAAAGCAGC
GCATCTGAGC AGCGCCGTGT CAACCCGTCG TACCGTGTAC TATCGGGAAA GGAAAACAAT
CCATGA
 
Protein sequence
MSGRFFWFCV TLIGYVYAGY PALLTVLARL RPQPLFAPPA DLPIVTLLIA AYNEQNVIAA 
KLSNSLALDY PRDRLQILVA ADGSDDATPD IVADFADCGV ELSYRPERAG KLAAITRALA
LARGEIIVLS DANNLYDAGA LRALVAPFAD PSVGATTGAK VIAKGDGALG DSEGLYWKYE
SYIKRQETRL SSCTGAVGEI MAVRRGLLDQ PLLPEARLMA DDLALAMHVL KQGYRVVYIP
NARSIERVSA SAQDEQERRA RIVAQRFVLM RHSHRMLPLL NPLLVWQIVS HKYLRPFVPL
AMIGALLANL AAVIRPAAQG GMLRLASPFN WVMLALQAVF YALAWMGGRN ECRGIWGKAL
YIPAFLVNGN RAALVGLYRF LTGRHTSLWN RVQRRERESS ASEQRRVNPS YRVLSGKENN
P