Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_0208 |
Symbol | |
ID | 5537669 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 256814 |
End bp | 258535 |
Gene Length | 1722 bp |
Protein Length | 573 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640892371 |
Product | hypothetical protein |
Protein accession | YP_001430359 |
Protein GI | 156740230 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.212991 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.715673 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTAGAA CATCTGTAGG ACGTTTTTCT CTTGCACGCT ACTGGCCAGA GGTGGTCGCA TTCGTGATAC CTCTACTCGT TGCCAGTGCA TTGCTTTTTG GATATGTTGG ACCACTCAAC GGGTTCTGGT CTACCGATCA GGGGGTGAAG CTGATTCAGG TGCAGAGCCT TCTGTTGAAC AAATTCAGCA GTAGTGCGCT GATCTACCCC GGTGCGGCGA TCGATCAGGA TGACAGGGTC AGTCCGTTAC GAGGGCAATA TTATCAGCAC GGCGGGCAGA CCTATGCAAT GTTCTCGGAC GCCTTCGCTC TGATCAGCAG CGTTCCCTTC TTTTTCTTTG GGTTTGCCGG GTTATACCTG GTTCCACTGG TTTCTCTCGC TGCGCTGTCT TTGATCTGCA CTTGCATCGC TCGTCCGTTG CTTGGTCCGG GAGGGGCGTT GCTCTCGGCT CTGGCGCTGA CGTTGACATC GCCTTTGTTG TTTTACAGCG TTATTTTCTG GGAACATCTG CCAGCGATGT TGCTGACAAC CCTTGCTCTC TGGCAGTCGT TGGAAGGGTA TCATCGCAAT GAGCGGCGCC GGTTCTTTGG CGCCGGCATC GCTATTGGCG CCGCAGCGTG GTTGCGCAAC GAAGCCATAC TGGCAGCGCC GGCACTGATC GGTGCGGTTC TGCTTACACG GCGCTCGCAG TCGTTGCAAA CTGCTGCCTG GATCGGCATC GGTGCGGGTG CTGGAGTGTT GCCCTTGTTG CTCTACAACC AGATCGTGTT TGGCGCGGCG GTCGGTCCGC ATGTGCTGGT TGCGGGAGCG GCACAATATC GGGGAGCGAG TGATCCATTG ATGATGCGCA TAGAATGGGC TGATCGCCTG CTTGTACCAC TTGATGAACC GGTTCTGGCG GGCTTGGTGA TTGTACTGGT CATAGTCTCC ATTATCACGG CGATATGGCG TGCGCGCAGC GCGGCGAATG TCGGCTTCGC GCTCGCCATT GTCGTGACGA TTGTGATCGC AGCTGCCATT CAACGAGCGC CGCGCGGCGG GTTGCAAACA ACGCTGCTGA TGACATTTCC GGCAGTACTG CTGTGTTGCT TCCCGGTTGC CCCGAAGGAG CGATTAGATC GTGTTGACAC ACCTTTGATC CTGACCGCAT TTGGGTTGAC GTTTATCGCG CTGGCCTGGC TGGCGCTGCT GCCTGATGGT GGTGCGCAGT GGGGTCCACG CATGCTTTTG CCGGCGGCGC CGGCGCTGAC AATCGCAGGC TTCTGGCGCG CAGGGTCATG GCTTCGCCGC CCTGCTGCTG GCGCTGCGGT TGCGGGCGTG AGCGCCATTG TCCTCTTTGT CGCCATGTTG TCGCAATGTG CGGGATTGCG CCAGTTGCGC GATTTCAATA CGGCGAACCA TACCTTGCTC ACAACAGTCG CGCAGAGCGG AGCATCAGCG ATTATCACCG ATACGTGGTA CGGTCCGCCG CTGCTGGCGC CGATCTTCTA TGACGAACGC ATGATTTTTC TGGTTGATGA TGGCGCCGAT CTTGATTACC TTATCGAGCG GTTGAGCAAC GCAGGGTTCG ATACGGTCTA CTATTTGAGC GGGCGGCGTG ATGAAATTAC TTCCGATGCT CGACGATGGC GCGAACTGAC ACCGATCGGA GCGCCGGCTC GATTGGCGCA TCATCTTACC GGACAACTGT ACCAGATCAA TACGCCTCCA CCATCAGACT GA
|
Protein sequence | MGRTSVGRFS LARYWPEVVA FVIPLLVASA LLFGYVGPLN GFWSTDQGVK LIQVQSLLLN KFSSSALIYP GAAIDQDDRV SPLRGQYYQH GGQTYAMFSD AFALISSVPF FFFGFAGLYL VPLVSLAALS LICTCIARPL LGPGGALLSA LALTLTSPLL FYSVIFWEHL PAMLLTTLAL WQSLEGYHRN ERRRFFGAGI AIGAAAWLRN EAILAAPALI GAVLLTRRSQ SLQTAAWIGI GAGAGVLPLL LYNQIVFGAA VGPHVLVAGA AQYRGASDPL MMRIEWADRL LVPLDEPVLA GLVIVLVIVS IITAIWRARS AANVGFALAI VVTIVIAAAI QRAPRGGLQT TLLMTFPAVL LCCFPVAPKE RLDRVDTPLI LTAFGLTFIA LAWLALLPDG GAQWGPRMLL PAAPALTIAG FWRAGSWLRR PAAGAAVAGV SAIVLFVAML SQCAGLRQLR DFNTANHTLL TTVAQSGASA IITDTWYGPP LLAPIFYDER MIFLVDDGAD LDYLIERLSN AGFDTVYYLS GRRDEITSDA RRWRELTPIG APARLAHHLT GQLYQINTPP PSD
|
| |