Gene Rcas_3522 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3522 
Symbol 
ID5541021 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4588472 
End bp4589629 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content63% 
IMG OID640895640 
Productglycosyl transferase group 1 
Protein accessionYP_001433590 
Protein GI156743461 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.828408 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACGTCG CTATCAATGC GCATCTTCTT GCCCATACGC GCACCTTTCG GCGCGCCGGG 
GTATCGAACT ATGTTGAGGC GCTGCTCATT CATCTGGGAG CGATCGACCG TCAGAACCAG
TACACGGTAT ATACGACGCG CGGCCTCGAC AATGGAGCGC TCGGATTGCC GTCCAACTTC
TATGTGCAGC CCAGCCGCCT CCCGACGATC AATCCGCGCG TTCGCATTCC GTGGGAGCAA
CTGTTGGCGC CGGCGCTGCT TCGCTTCGCT CGCGCCGATG TGTACCATGG GGTGCTCAAC
GTTATGCCCC TCGTCTGCCC GGTTCCATCG GTCGTCACTA TTCACGATCT GAGCGCGTTT
CTGTTTCCGC AGACCTTTCG CCGCGTCAAC CGGATCTATA CCCGATGGGC GATCCGGGTT
GCCGCGCGCC GCGCCCGCTA TCTTCTGGCG GTGTCGGAGT TTACCCGGCG CGAGATTGTG
CGCTGGCTCC ACGTTCCGCC AGAGCGCGTA GTGGTGACGC CCAACGCTGC GGATGCACGC
TTCGCGCCTC CCGATCCAAC CACGCTCGAG GCGTTTCGTC GCCGTGCCGG GTTGCCCGAC
CGGTTTGTGT TGTTCCTCGG CACGCTTGAG CCGCGCAAGA ATCTGACACT GCTCCTGGAA
GCATATGCCC GGATTGTGCG CGATGTTGAT GCGCCGCTGA TCATCGGCGG CGCAAAGGGA
TGGCTGTACG AGCCGATCCT GGCGCGCGCT GAACAACTTG GACTCGGCGA CCGGCTCCGC
TTCGTCGGGT ATATCGACCA GGAGGATCAA GCGCTCTGGT ATGCGGCGGC TACGATCTTT
GTCTTTCCGT CGTTGTATGA AGGGTTTGGC ATGCCGCCGC TCGAGGCGAT GGCATGCGGC
ACGCCGGTGA TCGTCAGCAG TAGCAGTAGC CTCCCCGAAG TGGTGGGGGC GATTGATGGA
CATCCCGATC AGGCGGCAGC GCTCATTGTG CCGCCAACCG ACGCCGATGC GCTGGCGGAG
GCGATGTTGC GACTGCTCTC CGATGCAGAG TTGCGCGCCG AGTTGCGCGC CCGCGGGCTT
GCGCGCGCGC GTTGCTTCTC CTGGCGCACG ACGGCGGAGC GAACGCTGGA GGTGTACCGG
CAAGCAGCAT GTGGGTGA
 
Protein sequence
MHVAINAHLL AHTRTFRRAG VSNYVEALLI HLGAIDRQNQ YTVYTTRGLD NGALGLPSNF 
YVQPSRLPTI NPRVRIPWEQ LLAPALLRFA RADVYHGVLN VMPLVCPVPS VVTIHDLSAF
LFPQTFRRVN RIYTRWAIRV AARRARYLLA VSEFTRREIV RWLHVPPERV VVTPNAADAR
FAPPDPTTLE AFRRRAGLPD RFVLFLGTLE PRKNLTLLLE AYARIVRDVD APLIIGGAKG
WLYEPILARA EQLGLGDRLR FVGYIDQEDQ ALWYAAATIF VFPSLYEGFG MPPLEAMACG
TPVIVSSSSS LPEVVGAIDG HPDQAAALIV PPTDADALAE AMLRLLSDAE LRAELRARGL
ARARCFSWRT TAERTLEVYR QAACG