Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_3899 |
Symbol | |
ID | 5541405 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 5103027 |
End bp | 5104226 |
Gene Length | 1200 bp |
Protein Length | 399 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640896010 |
Product | hypothetical protein |
Protein accession | YP_001433953 |
Protein GI | 156743824 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00043819 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.779537 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATATCG GTGATATGCT CGAACGTATG CGCGACCCGC TGGGTGCGCC GTTTTATCCG CCGGCATTGC AGGTCTTGCT GGTCGTCACG TGGGTGTTCC ATATCTTCTT CGTCACCCTG GCGCTTGGAT CGAGCGCATT TTCGATCCTG GGATTCCTGC GACCGAATGA GTATCGCCTG CGCCTGGCGC GCGTCGCGGC GCGGTTGACG CCCAACGCCG TCGGTCTCGG CATTGTGACG GGAATTGCGC CGCTGCTCTT CGTGCAAACG ATCTACGATC CTATCTGGTA TGCGAGCAAT ACGCTGAGCG GCTTCTGGTC GGTGAGTTTC ATTTTTGTGG TGATGGGCGG GTACAGCCTG GCGTACCTCT TCTATCTCAA AGGGAGCGCT GACGGAAGGC TGCTCTGGTC GGCGGTCGCA TCGTTCATCT TGCTCTTCTT TGCCGGATGG GTGATGCACG TGCTGGCATC GGTGTCGATT CGCCCAGAGC GCTGGATGGA GTGGTATGCC CCAGGTGGGG TGATCGACAC GCGCGGCGTT GTGTTCCATT CCTGGAATAT CCCGCGCCTG GCGTTCCTTC TGCCGCTCCA GGCCGGGTTG AGCCTGGCAG TGGTGCTGAC GTTGTTCGCC TGGTACTTTC GGCGAATCGA AGAGGATGCG CCGTTTATGA CGTGGGTCGC CGATCGAGGG CGATGGCTGG GTCTGGCGGT CAGCCCGCTG TATGCGCTCG CCGGGCTGCT CTGGGCAGCG ACAGAAGGCG CCGAGTTTGG GGTTGGCATG CCGGTTGGCA TCGCGCTCGC TGCCGTCGGT CTGGCATTGA CCGGCTATTT CTTCTTTCTG AAGCAGCCAA TGCAGCACGC ACCGCGCACA CTGCTGGTCT GGATCATCGC CCTGGTTGTC GTCGGCATTG TGCGTGAAGG TATTCGCGCC GTCTCACTCG CGCGCTTCGG GTATAGCGTC TCCAACTATC CATACATGAT GGATTGGGGA TCGATCCTGG TGTTTGGAGT AACAACCGTG GTTGGCGTTG CCGTAGTGAC GTACCTGGCG CTGGTGCTGT ATCAGTCGGG CGGAACGAAG CGTGATGCCC AGGTTTCGCC GCGGGTTGAA CGTCTTGGCT CGATTGCAAC CGGTATGCTG GGCGCGTGGT TTGCCTTCTT CATCCTGGTG GGGCTGTATA CTACGTTCTT CCTCAAGTAA
|
Protein sequence | MDIGDMLERM RDPLGAPFYP PALQVLLVVT WVFHIFFVTL ALGSSAFSIL GFLRPNEYRL RLARVAARLT PNAVGLGIVT GIAPLLFVQT IYDPIWYASN TLSGFWSVSF IFVVMGGYSL AYLFYLKGSA DGRLLWSAVA SFILLFFAGW VMHVLASVSI RPERWMEWYA PGGVIDTRGV VFHSWNIPRL AFLLPLQAGL SLAVVLTLFA WYFRRIEEDA PFMTWVADRG RWLGLAVSPL YALAGLLWAA TEGAEFGVGM PVGIALAAVG LALTGYFFFL KQPMQHAPRT LLVWIIALVV VGIVREGIRA VSLARFGYSV SNYPYMMDWG SILVFGVTTV VGVAVVTYLA LVLYQSGGTK RDAQVSPRVE RLGSIATGML GAWFAFFILV GLYTTFFLK
|
| |