Gene Rcas_3940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3940 
Symbol 
ID5541446 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5146292 
End bp5147500 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content65% 
IMG OID640896048 
Productglycosyl transferase group 1 
Protein accessionYP_001433991 
Protein GI156743862 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCATCG CATTCCTGTG CACATCCGGT CTCGATTATC CATCGCCGCG CGGGCGCTGG 
CTGCCGCTGG CGCGCCGCCT GGCGCGCGAG GGGTACGAAC CGCATCTGCT GATGCTCCAC
CCGACTTTCG ACCGGCTGAA GGTACGACAG TTCGCCCATG ACGGGGTGCA TTGCGCCTAT
GTCGGGCAGA TGCACGTGTA CGGGCTGCCC GGCGAGCGAC GGCACTTCGG CGCGCTGGAA
CTGGCGTCGG TTTCACTGCA AGGCGCCCTG GCACTGGCGC TGGCAACCGT CCGCCTGCGC
CCCGATGCCA TCCATGTGGC GAAGCCGCAG CCGATCAATG GGCTGGCTGG CATCCTGGCG
GCACGCAACG GCACGGCGCT GTATGTCGAT TGCGACGATT ACGAGGCGGA GGCAAATCGT
TTTGGCGGCG CCTGGCAACG GCGGGTTGTG GCATGGTGGG AGGATCGCCT GCCACAGATG
GCGCGTGGAG TGAGCGTCAA TACGCACTTC CTCTATGATC GTCTGCGATG CCTGGGGGCG
CCCGAACAAC GCTTGCGCTA CGTTCCAAAT GGCATCGATC TGGAACGGCA GACGCCGCCG
GACGCGCGTC AGGTCGCGGC ACTACGCACG GCGCTTGGTC TAACCCATCA TCCGACGGTG
GTCTATCTCG GCGCAATCAG CGCCGTGGCG CATGGGGTGC GTCTGCTCAT TGATGCGTTT
GCGATGTTGG GGAAACATCT CCCCACGGCA CGCCTCGTGA TCATTGGCGA CGGCGATGAT
CGTCCGGCGC TGATGGCATA TGCCCGGGCG CGTGGTCTGG AGCGGACGAT CATCTGGGCA
GGGCGCATTC CACCTGAAAC TGCGCTCACA TGGCTGGCAG TCGGCGATTG TTCGGTCGAT
CCGGTGGAAG CGACGCCAGC CGCCGCTGCG CGATCGCCGC TCAAGATTGT CGAAAGTATG
GCGGTAGGGG TGCCGGTCGT GACCGGCGAC GTTGGCGACC GGCGTGAGAT GCTCGGCGAC
ACTGCCGGGC TGATCGTTTC TCCCGGCGAT GCGCGCGCGC TGGCGGATGG CATAACGACC
TTGTTGACCG ATCCGACGTA TCGCGCGCAA CTGGCGCAGG GGGCGCGTCT GCGAGCGGAG
GCTTACAATT GGAACCGGCT GGCATGCGTC TGGCAGACGC TCTATCAGAT CGGCGCATCA
TCGCTGTGA
 
Protein sequence
MRIAFLCTSG LDYPSPRGRW LPLARRLARE GYEPHLLMLH PTFDRLKVRQ FAHDGVHCAY 
VGQMHVYGLP GERRHFGALE LASVSLQGAL ALALATVRLR PDAIHVAKPQ PINGLAGILA
ARNGTALYVD CDDYEAEANR FGGAWQRRVV AWWEDRLPQM ARGVSVNTHF LYDRLRCLGA
PEQRLRYVPN GIDLERQTPP DARQVAALRT ALGLTHHPTV VYLGAISAVA HGVRLLIDAF
AMLGKHLPTA RLVIIGDGDD RPALMAYARA RGLERTIIWA GRIPPETALT WLAVGDCSVD
PVEATPAAAA RSPLKIVESM AVGVPVVTGD VGDRREMLGD TAGLIVSPGD ARALADGITT
LLTDPTYRAQ LAQGARLRAE AYNWNRLACV WQTLYQIGAS SL