Gene Rcas_0017 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0017 
Symbol 
ID5537474 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp17324 
End bp18325 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content61% 
IMG OID640892182 
Productglycosyl transferase family protein 
Protein accessionYP_001430174 
Protein GI156740045 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0472] UDP-N-acetylmuramyl pentapeptide phosphotransferase/UDP-N-acetylglucosamine-1-phosphate transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000233702 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCCAAA TCCTCCTGAT TTTCATTACT GCCCTGCTCT TTTCGGTGTT GGCGACGCCG 
GTAGCGCGGC GGGTGGCATT GCGCACCGGC GTGGTCGATG CGCCGGCGGC ACGCAAACTG
CACCTCGCGC CGGTGCCGCT GCTCGGCGGC GGCGCGATCT ACACAGCGTT TGTCGTGGCG
CTCATCCTGT TTGGCGATCA GGCATATGTG CGTGAGTTGA TCGGCATTCT GCTCGGCGCT
ACGCTGGTCT CGCTCTTTGG TCTTGCCGAT GACCGCTGGG GATTGAACGC CTATGTGAAA
CTTGGCGCTC AGGCGCTGGC GGGTGCGATC CTGATCCTTG GCGGAACCCA GGTGCGTCTC
TTCCCGGTTG AATGGATGAA CTGGGCGATA ACACTGTTCT GGGTAGTAGG TATCACGAAT
GCGCTCAATC TGCTCGACAA TATGGATGGG CTTTCGGCAG GAGTGACGAC GGTTGCTGCC
GCTTATTTCC TGCTCCTGGC TGCGATGAGC GGACAATACC TGGTCGGAGC GATGGCTGCT
GCGCTGATCG GTGCGTGTGT TGGCTTTCTG CGCTACAATC TCAACCCGGC GACGATTTTC
ATGGGTGATG CCGGTTCGCT CTTCCTGGGG TTTCTCCTGG CGGCGCTGGC GATCAAATTG
CGCTTCCCGT CCAATGTTCC ATGGGTGACA TGGCTGGTGC CGGTGTGTGT CCTGGCAGTG
CCGATCTTCG ACACGTCACT GGTTTTCGTC TCTCGTCTCC GCCGGGGCAA AAACCCATTG
ACCACACCGG GGAAGGATCA TGTATCGCAC CGCCTCACTG CCCTCGGTCT GACTCGTCGT
GAAGCGGTGC TGATCTGTTA CCTGCTGGGA TGCGGCGCGG GAATGGTGGC GGTGTACATC
TCGCAGGCGC GCGCGCCTGA TGGGTATGTT GCCGCAGGAT TGCTCGCTGC GGCAATGCTG
GCGGGAATCG TCTGGTTCGA GCGACGTCAG GGCGGGGGAT AG
 
Protein sequence
MTQILLIFIT ALLFSVLATP VARRVALRTG VVDAPAARKL HLAPVPLLGG GAIYTAFVVA 
LILFGDQAYV RELIGILLGA TLVSLFGLAD DRWGLNAYVK LGAQALAGAI LILGGTQVRL
FPVEWMNWAI TLFWVVGITN ALNLLDNMDG LSAGVTTVAA AYFLLLAAMS GQYLVGAMAA
ALIGACVGFL RYNLNPATIF MGDAGSLFLG FLLAALAIKL RFPSNVPWVT WLVPVCVLAV
PIFDTSLVFV SRLRRGKNPL TTPGKDHVSH RLTALGLTRR EAVLICYLLG CGAGMVAVYI
SQARAPDGYV AAGLLAAAML AGIVWFERRQ GGG