Gene Rcas_3646 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3646 
Symbol 
ID5541148 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4770837 
End bp4772360 
Gene Length1524 bp 
Protein Length507 aa 
Translation table11 
GC content60% 
IMG OID640895766 
Productglycosyl transferase family protein 
Protein accessionYP_001433713 
Protein GI156743584 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00027326 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCTCAC CGCTCGTTTC GATCATCATC CGGGCGCGGA ATGAAGCGCC TGCGCTGCGA 
CGCCTCTTGC CTCTGCTCCA ATCGCAGGAG GTTGATTTTC CGTTCGACAT CTGGCTGCTG
GATAACGACT CGCAGGATGA GAGCGCAACT CTGGCGCGCG CGTATGGTCT CCGGTATCAC
CATATTCCGC GCGGCGGGTT CAACTATGCG GCGGCGCTCA ACCTGGGCGC ATCGCTGGCG
GAAGGCGAGT TTATCGTCAG TCTGTCGGCG CACTGTTTTC CGCAACGCAG TGACTGGCTG
GCGGCGCTCG TGGCGCCGTT GCGCGCGGAT CGCAACGTTG TCGCCGCTTA CGGTCGGCAG
GTCATTGATC CGCAGAGCGG CGCATTCGAG GCGCATGGCA ATGCTGAGTT GTTTCCGGCA
GATGCGCGCC AGCCGAAGAT CGTCGCATTC TCGAACGCCA ATAGCGCCAT TCGGCGCGAT
TATCTCTTGA TCCATCCCTT CAATCCGGCG ATCAAGATTC TGGAGGATCA CCTGTTCTAT
CTGGAGATCG CCGGGGATTT CGATGTGGTC TATGTGCCCG AAGCCGTCGT GCTCCATGAG
CATGATCGCT TTTCGTGGCG TTACTATGTG CGACGCTGGA TACTGGAGGG ATGGTCGTTC
TATTTTTTGA CGCGCCATCG CGGGTTGCCG TCGCCCTATA TTCCGCAACG GCTCGTTTCT
CTCCGCCGAT TGCTGTTCAT CTATCCGCGC ATCGCCGTCG CATATGCGCG GCGTGGTCGG
TGGGCGCCGG CGCTGCGCGC CATTCCCTTC TTCTGGCTGC GCGATCTGAT CTGGCTGGCG
AGTTTTGTGC GCGCCCGCCT GCTCCACCCG GTCATGGCGC AGACTGATAC GGCGCTGCTG
CTGCGCACCA ATCGGTTTTT GCGCCGTCAG GCGATGCAGC AGTCGCGCAT GACGCTTCCC
GACGCGCCGC TCGACTGGCG CGAGGAATGG CAACTCAAGG CGGATTGGGG GTTTATCCGG
CGGAATATTG CCGATTTCAT TCGTTTGTGC CACGAACGCG GGTTGTTTGC CAGTCCGCTG
CTGGAAGTCG GCGCATCGGG GCAGAACGAC TACCTGGCGG AGTGGTATGA CATGCGCACC
TCCAATCTGG CGTCGAACCT GCACAGCGCG GACATGGCGC TCGATATGGA GGATATGCGC
CAGATCGCCG ATAATTCGCT CGGTTCGATT CTCTGTTCAG AGGTGATCGA GCATGTGCGG
CATCCTGAGC GCGCGATTGC GGAAGCGTTT CGCGTGTTGC GTCCAGGCGG CACGCTGATC
ATTACAACGC CGTACAACAT TGTGATTCAC AACACCCCCG AAGATGGCGG CTTCCACGGG
CGCAACTTCA CGCCGCAGGG GTTAGAGTTG ATTCTGCGCG AGGCGGGGTT CGATATTGTG
CTGCTCGAAA CGCGCGGCGC TACTGAGATG CGCCGTCGTC TGATGCCGAG CAATGTGTTT
GCAGTGGCGC GGAAGCCGGG GTGA
 
Protein sequence
MTSPLVSIII RARNEAPALR RLLPLLQSQE VDFPFDIWLL DNDSQDESAT LARAYGLRYH 
HIPRGGFNYA AALNLGASLA EGEFIVSLSA HCFPQRSDWL AALVAPLRAD RNVVAAYGRQ
VIDPQSGAFE AHGNAELFPA DARQPKIVAF SNANSAIRRD YLLIHPFNPA IKILEDHLFY
LEIAGDFDVV YVPEAVVLHE HDRFSWRYYV RRWILEGWSF YFLTRHRGLP SPYIPQRLVS
LRRLLFIYPR IAVAYARRGR WAPALRAIPF FWLRDLIWLA SFVRARLLHP VMAQTDTALL
LRTNRFLRRQ AMQQSRMTLP DAPLDWREEW QLKADWGFIR RNIADFIRLC HERGLFASPL
LEVGASGQND YLAEWYDMRT SNLASNLHSA DMALDMEDMR QIADNSLGSI LCSEVIEHVR
HPERAIAEAF RVLRPGGTLI ITTPYNIVIH NTPEDGGFHG RNFTPQGLEL ILREAGFDIV
LLETRGATEM RRRLMPSNVF AVARKPG