Gene Rcas_2055 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2055 
Symbol 
ID5539535 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2635606 
End bp2637795 
Gene Length2190 bp 
Protein Length729 aa 
Translation table11 
GC content54% 
IMG OID640894191 
Productglycosyl transferase family protein 
Protein accessionYP_001432160 
Protein GI156742031 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.265911 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGACCG CTACCATCAC ACAATGGATC ATGCGGCAAG TAACTTTTGT ACGTCCGATG 
ATTCGCCCAA TCCTCACGAC GCTTGCACTG CTCACCGCGC TCCTGTTTGT CGCATCAACC
ACACCGCATT CTATTGCATA CCCTGCCGAA GGCATCGGTT TCAGCGTCGA TGGTTTCTGG
CAACCGGAAT TCAACGCTGA GCGCTCATAT CGCTGGACGA GAGGCATAAC GCGCGTGCGC
ATTCCAGGAT TTGAAGATGC ATCGCTCCTC CTGGTTTCGC TCCAACTCTC CGCGCCGCAA
CAATCAGGTG CAACGCATAC GCCTGCAACT GTGGCAACGG ACAAGAGTGC GCCAATGCGC
TTCAGTATTG CTCCCGAATG GCGCACCTAT CATTTGCTGA TCGATGCGCA ATCCCCAGAT
TGGCGCATTC CTGCGTTGAT GATCACAAAC CCTACGTGGC GTCCACGCGG AGATGAACGT
GATCTAGGCA TCGTGTTCGG TCATATGAAT GCGCAGCGCC TTCTTCCATC TTATACCGCT
GCTATTGTTG AGCGCTGGAT GTTTCTTGCG TCGCTGGTCG CTCTGACAAC GGTTGCCACG
CGATGCGCAC CGCGATGGAA TCTTCTGCCA GGCATTCTGA CGGCGCTGCT GGTGATGGGT
TCGCTATGGG CGCCGGTTCG TCTAAGTCAG GCATTGCCGA CGAACTGGCA GATCGTCATC
GGCATAGCAA TCGTTACGTT GTATATCGAA GGCTTCCGTC GCTACCGACA ACGTATGCCA
AACAGCATTT TTCCACTCAT CGGGGTTGCT GGCGTCACCC TCGGCACCGG GATTGTTTGG
AGGGGATGGG CAATTGTCGG AGCAGCAGTG ATGATATGTA GCGCCCTCCT ATCCGTGTAT
ACGCGCACAT TGCGCTCCGA TTCAGGTAAG CGTCAGTCTG CGGATCGGTC ATCGCTCCGC
CGCCTGTGGC TGAGTGGTCT GGCGATTGCA GTTTTCGCCG GCGGCGTGCT TATTCCGAAC
AATAACGACG TATATCACGG TGATGAGAAC TTTTGGGCGA CCACCGGATT GCGCGCTTTT
CGTCTCGCAT TCATCGAACG TGATATATAT CACCCTTTCT GGACTGAGCC TTTAAGATCT
TCTCTGTGGG CATATTCGCC GATATTCGCA CTGCCACATC CGCAGATCGG CAAATACTTG
TTTGGAGCAG GGTTATATCT TGCAGGGCAC ACTGACCCGC TAATACGAGG ATATGACTTC
TCTCGAAGTC TGGCTTGGAA CAAAGCGCAC AATCGCGTAA TATCTCCTGA TGTTGCCAGC
GCTGCCCGTT TCCTTGTCAC ATTGACAGCA ATTGCGAGCA CTGTGCTCGT CTACTGGATC
GGGGTTCGTC TCGGCGGCGT CGCCGTCGGC GTGCTGGCTG CTGCGCTGAT GAACGCTCCG
GAGTCGATGC GTTATCACGG AAAGATCATC ATGCTTGATG TTCCCGCATT GATATTCGGG
TTGCTTGCTC TGGTGATATG CGCTGAGATG CTGCGTAGTT GGCGCCAGGG AAATCGGCGC
GCCATCTTCT TGACCATTGC GTGCGGCGTA GTCTGCGGTC TGGGATTAGG AACAAAACTC
AATGCCGCTC TTGTCGTCCT AACCTGCCTC TTTGTGTCCG TCATCTTCGC CGTCTGGCGC
AGTCGCCGCT TCGCAGTGAC GCCGTTTGTG CCAAAGCGCG CCGTCGTCGC CATTGCTCTG
TGCGCCTGGA CGGTATTCTT TCTCTCCAAT CCTGCGCTCT ACCGGCAACC GGTCGAAGGC
ATCCGGCGGA TGCTGGACTT TGGTACTGAT ATAGGGTGCA ATCCAGAATT GTCCTGGTGT
CATCCCTTAC CAACACCAAC TGATCGGATG AGGGCTATCT GGTTTTTTCT AAGTGATGAA
GGCGAAGTTA ATGCTGGCGG TCTGCCTGGC AGTCACCTGT TATTGATCAT TGGCGCCGCA
GCATTGGCGG TCAGACTGTC GCGCTGGACG GGAAGCAACG ATGTTGAAAT TGTCTTAATC
AGCGCATGGA TAATTATCAC CCTCGCAGGA TTGATGCTCT GGCTACCCAT TGGCATTTTT
CGCTATGTTA TGCCGCTTGT TCCGATTTCG ATGCTTCTTC AGTCGTATGG TATAATTGGA
ATTCTGCGAA GCATCACACG CACGGATTGA
 
Protein sequence
MQTATITQWI MRQVTFVRPM IRPILTTLAL LTALLFVAST TPHSIAYPAE GIGFSVDGFW 
QPEFNAERSY RWTRGITRVR IPGFEDASLL LVSLQLSAPQ QSGATHTPAT VATDKSAPMR
FSIAPEWRTY HLLIDAQSPD WRIPALMITN PTWRPRGDER DLGIVFGHMN AQRLLPSYTA
AIVERWMFLA SLVALTTVAT RCAPRWNLLP GILTALLVMG SLWAPVRLSQ ALPTNWQIVI
GIAIVTLYIE GFRRYRQRMP NSIFPLIGVA GVTLGTGIVW RGWAIVGAAV MICSALLSVY
TRTLRSDSGK RQSADRSSLR RLWLSGLAIA VFAGGVLIPN NNDVYHGDEN FWATTGLRAF
RLAFIERDIY HPFWTEPLRS SLWAYSPIFA LPHPQIGKYL FGAGLYLAGH TDPLIRGYDF
SRSLAWNKAH NRVISPDVAS AARFLVTLTA IASTVLVYWI GVRLGGVAVG VLAAALMNAP
ESMRYHGKII MLDVPALIFG LLALVICAEM LRSWRQGNRR AIFLTIACGV VCGLGLGTKL
NAALVVLTCL FVSVIFAVWR SRRFAVTPFV PKRAVVAIAL CAWTVFFLSN PALYRQPVEG
IRRMLDFGTD IGCNPELSWC HPLPTPTDRM RAIWFFLSDE GEVNAGGLPG SHLLLIIGAA
ALAVRLSRWT GSNDVEIVLI SAWIIITLAG LMLWLPIGIF RYVMPLVPIS MLLQSYGIIG
ILRSITRTD