Gene Rcas_0208 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0208 
Symbol 
ID5537669 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp256814 
End bp258535 
Gene Length1722 bp 
Protein Length573 aa 
Translation table11 
GC content57% 
IMG OID640892371 
Producthypothetical protein 
Protein accessionYP_001430359 
Protein GI156740230 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.212991 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.715673 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTAGAA CATCTGTAGG ACGTTTTTCT CTTGCACGCT ACTGGCCAGA GGTGGTCGCA 
TTCGTGATAC CTCTACTCGT TGCCAGTGCA TTGCTTTTTG GATATGTTGG ACCACTCAAC
GGGTTCTGGT CTACCGATCA GGGGGTGAAG CTGATTCAGG TGCAGAGCCT TCTGTTGAAC
AAATTCAGCA GTAGTGCGCT GATCTACCCC GGTGCGGCGA TCGATCAGGA TGACAGGGTC
AGTCCGTTAC GAGGGCAATA TTATCAGCAC GGCGGGCAGA CCTATGCAAT GTTCTCGGAC
GCCTTCGCTC TGATCAGCAG CGTTCCCTTC TTTTTCTTTG GGTTTGCCGG GTTATACCTG
GTTCCACTGG TTTCTCTCGC TGCGCTGTCT TTGATCTGCA CTTGCATCGC TCGTCCGTTG
CTTGGTCCGG GAGGGGCGTT GCTCTCGGCT CTGGCGCTGA CGTTGACATC GCCTTTGTTG
TTTTACAGCG TTATTTTCTG GGAACATCTG CCAGCGATGT TGCTGACAAC CCTTGCTCTC
TGGCAGTCGT TGGAAGGGTA TCATCGCAAT GAGCGGCGCC GGTTCTTTGG CGCCGGCATC
GCTATTGGCG CCGCAGCGTG GTTGCGCAAC GAAGCCATAC TGGCAGCGCC GGCACTGATC
GGTGCGGTTC TGCTTACACG GCGCTCGCAG TCGTTGCAAA CTGCTGCCTG GATCGGCATC
GGTGCGGGTG CTGGAGTGTT GCCCTTGTTG CTCTACAACC AGATCGTGTT TGGCGCGGCG
GTCGGTCCGC ATGTGCTGGT TGCGGGAGCG GCACAATATC GGGGAGCGAG TGATCCATTG
ATGATGCGCA TAGAATGGGC TGATCGCCTG CTTGTACCAC TTGATGAACC GGTTCTGGCG
GGCTTGGTGA TTGTACTGGT CATAGTCTCC ATTATCACGG CGATATGGCG TGCGCGCAGC
GCGGCGAATG TCGGCTTCGC GCTCGCCATT GTCGTGACGA TTGTGATCGC AGCTGCCATT
CAACGAGCGC CGCGCGGCGG GTTGCAAACA ACGCTGCTGA TGACATTTCC GGCAGTACTG
CTGTGTTGCT TCCCGGTTGC CCCGAAGGAG CGATTAGATC GTGTTGACAC ACCTTTGATC
CTGACCGCAT TTGGGTTGAC GTTTATCGCG CTGGCCTGGC TGGCGCTGCT GCCTGATGGT
GGTGCGCAGT GGGGTCCACG CATGCTTTTG CCGGCGGCGC CGGCGCTGAC AATCGCAGGC
TTCTGGCGCG CAGGGTCATG GCTTCGCCGC CCTGCTGCTG GCGCTGCGGT TGCGGGCGTG
AGCGCCATTG TCCTCTTTGT CGCCATGTTG TCGCAATGTG CGGGATTGCG CCAGTTGCGC
GATTTCAATA CGGCGAACCA TACCTTGCTC ACAACAGTCG CGCAGAGCGG AGCATCAGCG
ATTATCACCG ATACGTGGTA CGGTCCGCCG CTGCTGGCGC CGATCTTCTA TGACGAACGC
ATGATTTTTC TGGTTGATGA TGGCGCCGAT CTTGATTACC TTATCGAGCG GTTGAGCAAC
GCAGGGTTCG ATACGGTCTA CTATTTGAGC GGGCGGCGTG ATGAAATTAC TTCCGATGCT
CGACGATGGC GCGAACTGAC ACCGATCGGA GCGCCGGCTC GATTGGCGCA TCATCTTACC
GGACAACTGT ACCAGATCAA TACGCCTCCA CCATCAGACT GA
 
Protein sequence
MGRTSVGRFS LARYWPEVVA FVIPLLVASA LLFGYVGPLN GFWSTDQGVK LIQVQSLLLN 
KFSSSALIYP GAAIDQDDRV SPLRGQYYQH GGQTYAMFSD AFALISSVPF FFFGFAGLYL
VPLVSLAALS LICTCIARPL LGPGGALLSA LALTLTSPLL FYSVIFWEHL PAMLLTTLAL
WQSLEGYHRN ERRRFFGAGI AIGAAAWLRN EAILAAPALI GAVLLTRRSQ SLQTAAWIGI
GAGAGVLPLL LYNQIVFGAA VGPHVLVAGA AQYRGASDPL MMRIEWADRL LVPLDEPVLA
GLVIVLVIVS IITAIWRARS AANVGFALAI VVTIVIAAAI QRAPRGGLQT TLLMTFPAVL
LCCFPVAPKE RLDRVDTPLI LTAFGLTFIA LAWLALLPDG GAQWGPRMLL PAAPALTIAG
FWRAGSWLRR PAAGAAVAGV SAIVLFVAML SQCAGLRQLR DFNTANHTLL TTVAQSGASA
IITDTWYGPP LLAPIFYDER MIFLVDDGAD LDYLIERLSN AGFDTVYYLS GRRDEITSDA
RRWRELTPIG APARLAHHLT GQLYQINTPP PSD