Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_1501 |
Symbol | |
ID | 5538976 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 1916983 |
End bp | 1918545 |
Gene Length | 1563 bp |
Protein Length | 520 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640893639 |
Product | hypothetical protein |
Protein accession | YP_001431613 |
Protein GI | 156741484 |
COG category | [S] Function unknown |
COG ID | [COG1543] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.53215 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.500126 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCCACG ATGGACAGCA CGAGGAAGGA AACCGGTCGT TTCCCGGCGC TCTGGCGCTC ATACTCACGG GTCATATGCC ATACCTGCGC GCTGCCGGTC GTCGTCCCGA CGGCGAAGAC CCGTTGCATG AGATGATCGC CTGTTCGATC ATTCCCACGT TGAATGTCCT GTATGATCTG CGCGAACGTG GCATGCGCCC ATACGTGTCG CTGGCGTATT CGCCAGTGCT GCTCGAACAA CTGGCGGACA TCGTCGTACA GAAACATTTT GTTATCTGGA TGGAACGCTG GCTGGCGCGC TGCGAAGCGG CGCTGCGGCG CTGGCAGCGG CAGCGGCAGC GGCATCAGGC GTATCTGGCA CGTTTTTATC TCGACTGGGG TCAGGGGATT CTTCACAGTT TTACCACGCG CTACGGGCGC AATCTGGTCG CGGCGCTGCG CGAGTTGTGT GCGACCGGAA CGGTCGAGCC ACTGGGAGGC GCCGCAACGC ATGCATATTT GCCACTGCTG TCGCGACAGG AGTCGGTGCG CGCGCAGCTC GATATCGGAA CTCTGACGGT GACCCGTCTG CTCGGTCGCC GTCCGCGCGG CGTCTGGCTG CCTGAGTGCG GTTTTCGTCC GGGATTGGAG CAGGTGCTGC GGTTGAATGG TACGCGCTAT TTCATTATCG ATCCGGCAAG CGTCGCCGTC GATGCCTGTG TCACCCACCT GCGTCCGCGA TGGGTGATGC CGCGTCGTCT GATCGCACTG CTGCGCGCGG TTGATGCGTC GCTTCAGATC GTCTCCCCCG CCATTGGGTA TGTCGGTGAT CCGTTGTATC TGGCGCCCCG TCGTGATCGG AGCACGCATC TATCCATCTG GCGCAACGGA GACAGTGATA CCGTCATCGA GCCGTATGAT CCGTATCACG CCTTTCGACG CGCTCAGGAA CACGCCATCC ATTTTGCTGA ATGGGCAGCA GCCGAATTAC GCGCTTTCGC CAATCGTCAT GATCGTCCAG GGATGTTGGT AGTTCCGCTC GATGCGGAGG TACTGGGTCG GCGCTGGTTC GAGGGTGTCG CCTGGCTGCG GACGCTCCTG GAAACTGTTC TGATCCATCG ACCGTTTGCC CTGACAACGC CTTCGCCGTA TCTGCGCGCC TTCCGACCGC GCCAGAGCAT CGTCCTGCGG GATGGATCAT GGGGTCCTGG CGGTGATCAT TCGGCATGGA ATGCGCCGGC AGGCGCTCTG CTCCGTCGTG CCCTTGATGA AACGGAGGAT CTCGTCGTTG GGGTGGTGCG GCGCTTCCCC GATGCCCGCG GCGATAGAGA ACGCGCACTC AACCAGGCAG TGCGCGAATT GTTGTTGGCG CAGGCGAGCG ATTGGTTGTT GCTCCTCGGT CGGAATGATG CCAGTGAGTC GCATCGTCCG TGGGTTCATC TGGCGCGCTG CCGGCAGTTG TGCGCGCTGG CGGAGCGCGC CTCGCTCGAT GAGGACGATC AGCAGACACT TGCCGCTATT GAAGAGATTG ACAATCCCTT CCCTCATCTC AATTATCGTA TTTTGACGGC AGAGACGGTG TGA
|
Protein sequence | MSHDGQHEEG NRSFPGALAL ILTGHMPYLR AAGRRPDGED PLHEMIACSI IPTLNVLYDL RERGMRPYVS LAYSPVLLEQ LADIVVQKHF VIWMERWLAR CEAALRRWQR QRQRHQAYLA RFYLDWGQGI LHSFTTRYGR NLVAALRELC ATGTVEPLGG AATHAYLPLL SRQESVRAQL DIGTLTVTRL LGRRPRGVWL PECGFRPGLE QVLRLNGTRY FIIDPASVAV DACVTHLRPR WVMPRRLIAL LRAVDASLQI VSPAIGYVGD PLYLAPRRDR STHLSIWRNG DSDTVIEPYD PYHAFRRAQE HAIHFAEWAA AELRAFANRH DRPGMLVVPL DAEVLGRRWF EGVAWLRTLL ETVLIHRPFA LTTPSPYLRA FRPRQSIVLR DGSWGPGGDH SAWNAPAGAL LRRALDETED LVVGVVRRFP DARGDRERAL NQAVRELLLA QASDWLLLLG RNDASESHRP WVHLARCRQL CALAERASLD EDDQQTLAAI EEIDNPFPHL NYRILTAETV
|
| |