Gene Rcas_4071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4071 
Symbol 
ID5541582 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5284590 
End bp5285717 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content62% 
IMG OID640896183 
Productglycosyl transferase family protein 
Protein accessionYP_001434121 
Protein GI156743992 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0472] UDP-N-acetylmuramyl pentapeptide phosphotransferase/UDP-N-acetylglucosamine-1-phosphate transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.507876 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACTGC TGACCATTGT GGCGCTGCTG AGCGCCTTTA TGATCGCCTT CATCGTGACC 
GCGCTGACGG TTCCTCCGGT GATTCGCCTC TGCGAACGAC GCGGATGGAT GCAGCAGCCC
GGCGGTCGCC GCACCCATCC GCACCCTACT GCAAATGTGG GCGGCATCGC CATGTATGTT
GGCTTCGTAA CAGCGATCCT TGCCACATTC ATCTTTAGCG CACTCGACCC GGCGCTGCGC
CGCTCGGAGT TCGAGGTGTT GCGCATCGGG TTGCTCCTGA CCGGCGGAAC GCTCATTTTT
CTGGTCATGT GGCTCGATGA TGTGGTCGAG CTTCCATGGT TCCCCAAGTT TGCGGCACAG
ATTGGCGCAG CACTCATTGC AGTCGGTCCG TACCTCTGGG ACCAGCGGCG CTACCCCGAC
ACACTGGGGT TGCTCACCGA AGCGCGCGGC ATTCTGCTGA CCGCCTTCAA CGCGCCGTTC
GTCGGGCAGG TCAGCCTGTG GGATGTCAGC CCATGGCTGG CAATCCTGGC AACGGTCTTC
TGGCTTGGCT GGATGGCCAA TACCATCAAC TGGTCGGACG GTCTTGATGG TCTGGCGGCT
GGCGTGTCGC TGATCGCGGC GTTTATGCTG GCGCTTCATG CACTCAGGCT CGACCCGCCG
CAAACAACGA TTGCGCTGCT GCCACTCGCG CTCGCCGGGA CATGCGCCGG ATTTCTGATC
TTCAACCTTC CCCCGGCACG GATTTTCATG GGTGACAGCG GTGCAGAGTT TCTCGGTTTC
ATCCTTGGCG TCAGCGCGAT CATCGGCGGG GCGAAACTGG CGACGGTGCT CCTGGTGCTG
GGTGTGCCGA TCCTCGATGT CGCATGGCTG ATCGTGGCAC GCACGGTCAG CGGCAAACAA
CCGATGCGTG GCGGACGTGA TCACCTGCAT CATCGCCTGC TGGACGGCGG CATGTCGCCA
CGCCAGATTC TGGCGCTTTA CTGGGGGCTG AGCGCAGGCT TTGGGTTGCT CGGCATCACC
GACATTAGCC CCTACGCCAA GTTGATCGGT CTGGCGCTGC TGATTCTGAT CGGCATCGGA
TTGATCGCTT ATGCCACACG CCGGGTGACG CTGCGATCGG TGCAGTAA
 
Protein sequence
MSLLTIVALL SAFMIAFIVT ALTVPPVIRL CERRGWMQQP GGRRTHPHPT ANVGGIAMYV 
GFVTAILATF IFSALDPALR RSEFEVLRIG LLLTGGTLIF LVMWLDDVVE LPWFPKFAAQ
IGAALIAVGP YLWDQRRYPD TLGLLTEARG ILLTAFNAPF VGQVSLWDVS PWLAILATVF
WLGWMANTIN WSDGLDGLAA GVSLIAAFML ALHALRLDPP QTTIALLPLA LAGTCAGFLI
FNLPPARIFM GDSGAEFLGF ILGVSAIIGG AKLATVLLVL GVPILDVAWL IVARTVSGKQ
PMRGGRDHLH HRLLDGGMSP RQILALYWGL SAGFGLLGIT DISPYAKLIG LALLILIGIG
LIAYATRRVT LRSVQ