Gene Rcas_3644 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3644 
Symbol 
ID5541146 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4767574 
End bp4769010 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content64% 
IMG OID640895764 
Productglycosyl transferase group 1 
Protein accessionYP_001433711 
Protein GI156743582 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0102527 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000773484 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGCACATTC TGTTGATAAC GCCCTGGCTG ACGATTGGCG GCGCGGATCG AGTCCATCTT 
GACCTGATTC GTCAGTTGAA CCGGCGCGGC TGTCGGTTCA GCGTGGTGGC GACATTGCCG
GCGAAGCACG AATGGCGCCC ATTGTTCGAG GAACTGACGC CTGATGTCGT CACACTGCAT
CCGACCATTG CGCCAGCACA GCAACCGGCG TTCGCGCGCG ACCTGATCAG GTCGCGCGGC
ATTCATGCAG TTCTCATCGG CAACAGTCAG TTCGGATATG CGTTGCTGCC GTATCTGCGG
TCCTGTTGCC GGGATGTGGC GTTCCTTGAC ATACTGCACG CAGTAGAACC ACACTGGCGA
GACGGCGGGT ATCCGCGCCT GTCGCTCGAC CATGCCGCCT GGCTCGATCT GAGCATCACC
GTTTCACGCG ACTTGCGCGA CTGGATGATC GCGCGCGGCG GTGACCCGGC GCACATCGAG
GTCTGCTATG CCAACATCGA TGTTGACGCA TGGAATCCGG CGCTTTTTGA CCGCGCAGCG
TTGCGGCGAG CGTTCGGCAT CCCGCCGCGC GCGCCGCTGA TCCTGGTGAT CGGGCGATTG
TCGTCGGAAA AGCGTCCACG TCTGGCGGTG CGCATCCTGC GGGAAGTGGC GCGCCAGGGT
ATCGCCTTTC ATGCGCTGAT TATCGGCGAT GGACCGGAGC GCCCGGTGTT GGAGCGGATG
CTGCGCGACC CATTGCTCCA GAACGTCCGC TTGACCGGCG CGTTGCCCGA AGAGCGGGTG
CGGGAGGTTA TGGCAGCCGG GGACGTGCTG CTGCTACCCT CGGCGCGGGA GGGGATTGCG
CTGGTGTTGT ACGAAGCGAT GGCGATGGGG ATGGTTCCGG TGGTCGCCGA TGTCGGCGGG
CAGCGTGAAC TGGCAACACC CGACTGTGGC ATGCTTATTC CATCATCCAA GAGCGAAGAA
GCGGCATACG TTGCGGCACT CGCCGGTCTG TTGCGCGATC CGGCGCGTCG CGCGGCGATG
GGTGCTCAGG CGCGACGGCG GATCGTTGAC CATTTTCGGA TCGATCAAAT GGGTGATCGC
ATGGAGATGC TGTTCAGGCG CGCGGTTGAG CGCGCGATGG GCGCCGAGCG TCCAATTCCG
ACGGAAGGCG ACGCGGCGCG CAGCGCGGTC GAGGCGATCC GCCTTGCGCG TCACGCAAGG
GATGTGGCGC GTTTGTGGCA AGCCGACAGG TATTCTGAGG ATGCGTCCCT CTCACCTGTG
CGCCGCGCCG TGTTGGGTGT TGTGCGCAGT ATGCGGCAAC GGTTGCGCCC ATGGTACCGA
CGTCTGGTCG GTCGTGATGG ACATCCCTTC AGCCGTGGAG TTGTTGCAGT GCGTGATCGG
GTAGTGGCGT GGGTGTATGA CGAAGGGCGA GAGGCCCCCC CCGGTAGGCG GGTTTAA
 
Protein sequence
MHILLITPWL TIGGADRVHL DLIRQLNRRG CRFSVVATLP AKHEWRPLFE ELTPDVVTLH 
PTIAPAQQPA FARDLIRSRG IHAVLIGNSQ FGYALLPYLR SCCRDVAFLD ILHAVEPHWR
DGGYPRLSLD HAAWLDLSIT VSRDLRDWMI ARGGDPAHIE VCYANIDVDA WNPALFDRAA
LRRAFGIPPR APLILVIGRL SSEKRPRLAV RILREVARQG IAFHALIIGD GPERPVLERM
LRDPLLQNVR LTGALPEERV REVMAAGDVL LLPSAREGIA LVLYEAMAMG MVPVVADVGG
QRELATPDCG MLIPSSKSEE AAYVAALAGL LRDPARRAAM GAQARRRIVD HFRIDQMGDR
MEMLFRRAVE RAMGAERPIP TEGDAARSAV EAIRLARHAR DVARLWQADR YSEDASLSPV
RRAVLGVVRS MRQRLRPWYR RLVGRDGHPF SRGVVAVRDR VVAWVYDEGR EAPPGRRV