Gene Rcas_0787 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0787 
Symbol 
ID5538253 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp1029313 
End bp1030881 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content57% 
IMG OID640892939 
Producthypothetical protein 
Protein accessionYP_001430922 
Protein GI156740793 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.421707 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCACT CAATATCTTC CATCAAAGAA AGCGAAACAA CCATCGCTCA ACGACGCGCG 
CATCGAATAG CATCCTGGAG TGCGTTGAGC ATCGTACTTT TAGCGTTGAG CCTCTGGAAT
CTCGATGGAC CAGCGATGTG GTGGGACGAG GGATGGACCC TATCGGTTGC GAGAAATTGG
GCGGAGCAGG GGCACTACGG ACGCCTGCGC AATGGTCAGC GAGCGCGCCC TGGTCTGGAG
GCGGCTTTCA CTACAACGTT GCCGGTTGGA ATGATGATGC GCGCATTTGG TGTGGGTCTC
TGGCAGGGAC GACTCTTCGG AGCACTCTGT GCTGTGGCAG TTGTCCTGCT GTTGGCGGCG
CTCGCAGCCA GATTGTATGA CCGGCGCGTC GCGGTTGCGA CAGTCGTTGC CGCCCTGTGT
ATGACCGCAT TTCCGACAAT CCATCCATTG CTGCTGGGAC GCCGGGTGCT AGCAGAGATA
CCGATGCTAA TGTATCTGCT GGTCGGATAT CTGTTCCTAT GGCGTGCGCT TGTCAATCGA
TGGGTCGCAC TCTTCCCGGC GGCATTGTTT CTCGCGCTAG CCTGGGTGAG CAAAGCGCAA
CTCTCACCTT TCCTGATTGT ATCGCTGACA ATGTCCGCGC TAGTCGCCGC GCTGATGCGC
CGATGGCGCA TTGCCGCTCT TTTCACTCTC GTTGCAGGCG GAACAGTGCT TGGCGCCAGG
ATCCTTCAGC AATCGGTCTA TCCGATTCTG ATCGACGCTC AACTGCCGCC AGACCCAACA
ACAGGACTGA TTGAAACCGT TGCCATCGTG ACCGCCCCCG CTCGTCGCCT CGATGCCATC
CAGAATCTCG CCATCTTTGG GCTTCCTGCT CTTTGTGGCA TGCTCTGGGG GACGTGGCGA
CTGTGGCATG ATCGCTCTGC AGCCAGCAGC GGCGCGCCGG TCTGGTATAC TCGCCTGACC
TTGCTCGCAC TATGCGGCAG CTGGCTAGCA TGGTACCTCG TGTTTTCCAT CGGATGGGTG
CGTTACATGG GGCCTGCCAT TATTGTCGCG AGCATTTTTG TAGCTGACCT TCTAGCAAAC
GCTACCGATG GCTTTGCCAT TCGGCATAGT CTCGTGTCGC TAATCAACCT TCTCACGTTA
CGCCGGTGGA CACGGACGGG CGGAGCGGCG TTGTTCGGCA CAGTGCTCGT ACTCTGGGGA
GGAACGTTGA CAGCGGTAAG CGTTGCCGCC ACCTATCCGG TACGCGACTA TTCCGCTCAG
CGCGTGGCGC AGTGGCTCAA TGCGCAACCG GAAGGAACGA GGATCGAAAC GTATGAGACT
GAGCTTCACT TTTTGCTTGA TCAACCCTAT ACTTTTCCGC CAGACCAGGT GCATGTTGCG
TTGCTGCGAC GCCTCTGGGA AATAGACGAT AACGTGCTGA TCGCCTATGA TCCGATGGTC
AACGACCCGG ATTTCCTGGT AGAGGGCGGA ACAAGCGTTG CCAAACTGTA TGAGTCGACA
CTGGCAAGCG GACGATTCCG TCTCGTGCTG GAAGATGGAC CTTACCGTGT ATTCGAGCGA
GTGCGCTAA
 
Protein sequence
MDHSISSIKE SETTIAQRRA HRIASWSALS IVLLALSLWN LDGPAMWWDE GWTLSVARNW 
AEQGHYGRLR NGQRARPGLE AAFTTTLPVG MMMRAFGVGL WQGRLFGALC AVAVVLLLAA
LAARLYDRRV AVATVVAALC MTAFPTIHPL LLGRRVLAEI PMLMYLLVGY LFLWRALVNR
WVALFPAALF LALAWVSKAQ LSPFLIVSLT MSALVAALMR RWRIAALFTL VAGGTVLGAR
ILQQSVYPIL IDAQLPPDPT TGLIETVAIV TAPARRLDAI QNLAIFGLPA LCGMLWGTWR
LWHDRSAASS GAPVWYTRLT LLALCGSWLA WYLVFSIGWV RYMGPAIIVA SIFVADLLAN
ATDGFAIRHS LVSLINLLTL RRWTRTGGAA LFGTVLVLWG GTLTAVSVAA TYPVRDYSAQ
RVAQWLNAQP EGTRIETYET ELHFLLDQPY TFPPDQVHVA LLRRLWEIDD NVLIAYDPMV
NDPDFLVEGG TSVAKLYEST LASGRFRLVL EDGPYRVFER VR