Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_2799 |
Symbol | |
ID | 5540286 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 3616849 |
End bp | 3620202 |
Gene Length | 3354 bp |
Protein Length | 1117 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640894926 |
Product | peptidase S41 |
Protein accession | YP_001432888 |
Protein GI | 156742759 |
COG category | [S] Function unknown |
COG ID | [COG4946] Uncharacterized protein related to the periplasmic component of the Tol biopolymer transport system |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.38727 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTCCAC AAGGATACTA CCGCTGGCCA ACCATTCACA ACGATACCGT TGTCTTCGTC TGCGAAGACG ATCTCTGGAC GGTTCCGGCG TCGGGCGGCG TGGCGCGGCG GCTGACCGCC AATCCCGGCA GCGTTCAGTC GCCGGCGCTG TCGCCGGATG GCGCCCTTCT GGCATTCGTC GGACGCGACG AAGGACCCGG CGAGGTATTT GTGATGCCCG CCGAAGGCGG CGAGGCGCGT CGTCTGACGT TCCTCGGCGC AACCATGCGT GTTTGCGGCT GGGGTCGCAA TGGTCGTGAG ATACTGTTTG CCAGTTCCGC CAGCCTTCCC TTTTCACGGA TGGCATTGCT CTATGCTATT CGTGCCGACG GTGGCGAGCC TCGCCTCCTG CCGACGGGTC CGGCGGTTAC GATCTCGTAT GGTCCAACCG GAGGTGCTGT GATTGGGCGC AACGAAAGCG ACCCGGCGCG CTGGAAACGC TATCGCGGCG GGCGTACCGG CGATGTGTGG ATCGATCCAG ACGGAACCGG CGAATGGCGG CGCCTGATTG CGCTGCCCGG CAATATTGCC ATTCCCCTGT GGGTGGGCGA GCGTATCTAC TTTGTTTCCG ACCACGAAGG AGTCGGCAAT CTCTACTCCT GCCTGCCGAC GGGCGACGAC CTGCAACGCC ACACCTGGCA CCGCGAGTAC TACGCTCGTT TTCCTTCCAC CGACGGACGG CGGATTGTGT ATCACGCCGG CGCCGATCTG TATCTGTTCG ATCCCGCAAC CAACACATCG CGCAGGATCG AGGTTGAACT GCGCAGCCCG CGAACGCAAC GGAAGCGTCG TTTTGTCGAT CCGGCGCGTT TCCTTCAGAG CGTTGCGTTG CATCCCGAAG GTCATTCGCT TGCCGCAATC GTGCGTGGCA AACCGTTTAC CTTCGGCAAT TGGGAAGGCG CCGTCTTGCA GCACGGCGAT CCCGGCGCTG TGCGCTATCG CCTGGCTGAC TGGCTGCCCG ATGGCCGACG CATTGTGGTG GTCAGCGATG CAGTTGGCGA GGAGACGCTC GAAATCCATT CAGTTGCGTT CAGTAATGGC GGACAAAGAG CGGCAGCGTC AGATGCCGCT GCTGTGGATG CGGCTCTTTC CCCGTTTGAT GCACCGGTGC GCCTGGAAGG TCTCGACATT GGGCGTCCTA TGGCACTTGC GGTTTCTCCC AAAGCGCCGC TCGTCGCGGT TGCGAACAAT CGAAATGAAT TGCTGCTGGT CGATCTGACT GAACGCACCG TGCGGTTGCT CGATCGGAGT CGATATGCCT CGATGCTGGG CATTGCCTGG TCGCCGGATG GACGCTGGCT GGCGTATGGC TTTTGGGAGA CGGGGCAGAC CTCGATCATT AAGGTGTGCG AGGTTGCGAC CGGGACCATC ACGCCGGTCA CCCGACCGGT TCTGGTTGAT CGGTCGCCCG CGTTCGATCC AGAAGGCAAA TATCTTTACT TCATCTCGTC CCGCGATTTC GATCCGGTCT ATGACGACAT GCACTTCGAT CTTGGCTTCC CGCGCGGCAC GCGCCCATTC CTGGTGACGC TGCGCGCCAA CCTGCGCTCG CCCTTCGTGC CGAGACCACA TCCGCTCGAT CAGGCGACAC CCAAGCCATC GGTTGGCGAG GCGAAACCTG CCGGTGAGAC GGGCGGCGAT GCGCCGGGTG CTGAGGGAGC AGCACAAGGA ACGACAAAAT CCGAGCCGAT GATGACCATT GATCTGGAGG GAATTGCCAA TCGGATTGTG GCATTTCCGC TGCCGGTCGG ACGATACCGG CAGATAGCGG GTATCCCCGG CAAGGCGCTC TTCACGGTGT TTCCGGTCGA AAGCGCGCTC GGTCTATCCC GGATGCCGGG CGACTCGGCG GTTGCGCGCG GGCGTCTCGA TGTGTACGAT TTCGAGACGC TCAGTAGCGA AACGTTGATC GATGGCGTGT CTGCATTCTC TCTTTCCCAT GACTCAAAGA CGCTGATGTA TCGTGCCGGT AACCGGGTGC GCGTGGTCAA GGCAGGTGAA AGGCCAAAGG ACAATAGTAG CGAGCCGGGG CGGAAGAGCG GTTGGGTCGA TCTGGCGCGC ATCAAATTGA TGATTTCGCC GCCTGCTGAG TGGGTGCAGA TGTATCGCGA AGCCTGGCGG TTGCAGCGTG ACCACTTCTG GACGCCGGAT ATGTCGGGGG TGAACTGGCT CATCGTCTAC CATCGCTACC TGCCGCTGCT TGATCGAGTC GCCACACGCG GCGAGTTCTC CGATCTTCTG TGGGAGATGC AGGGAGAACT GGGCACCTCA CACACCTACG AGTATGGCGG CGATTATCGC CCGGAACCGC AGTACAGTCT GGGGAAATTG GGCGCCGATC TGCGCTATGA CGCTGAAACC GACAGTTATA TCGTCGAGCG CATTGTCGCC GGCGATGTGT GGAACGAACG CGCCAGTTCG CCGCTGGCGC GCCCCGGCGT CAATATTGCG CCAGGTGATC GCTTGATTGC GATCGGCAGT TGGCGCGTCG GGCGTGACGT CTCGCCGCAT GAGGTGCTGG TCAATCAGGC AGGGTGCGAT GTGTTGTTGA CGTTCAGGAA AGCCGATGGA ACACTCCGCG CAGTGACAGT AAAGGCGCTC CATGACGATA CGCAGGCGCG CTACCGCGAG TGGGTGGAGC GCAATCGCGC GATCGTGCAC GAGGCAACGA ATGGGCGTGT CGGGTATATT CACATTCCCG ATATGCAGGC GTTCGGGTAC GCCGAGTTCC ATCGCGGCTT CCTTGCCGAA GTCGCACGCG AGGGGTTAAT CGTCGATGTG CGGTATAATG CAGGCGGCTT TGTGTCGCCG CTGGTCGCCG AGAAACTGGC GCGCAAACGC CTGGGGTACG ATGTCTCGCG CTGGGGTGAA CCGGCGCCCT ATCCGCCCGA GTCGATTATG GGACCAATGG TGGCAATTAT CAATGAAGCG GCCGGTTCCG ATGGCGACAT CATCAGCCAT GTCTTCAAGA TGATGAAACT CGGGCCGTTG ATTGGCAAGC GCACCTGGGG CGGCGTGATC GGTATTCATC CACGTGACAC GCTCATCGAT GGCGGGGTGA CAACCCAGCC AGAGTTTTCT TTCTGGTCGG CGGAGGTTGG CTGGCAGTTG GAGAACCATG GCGTCGAGCC GGATATCGAA GTCGAGATGC GGCCACAGGA TTATGTGGCC GGCGCCGACC CGCAACTCGA CCGCGCGATA GCCGAAGTGT TGCGTCTGAT GAACGACAAT CCACCCCGGC TGCCGGAATT TGGTGAACGA CCGCGTTTGC CGTTGCCGGA AGAAGAGTGT GATGAGGAAC GATCTGGACG CTGA
|
Protein sequence | MAPQGYYRWP TIHNDTVVFV CEDDLWTVPA SGGVARRLTA NPGSVQSPAL SPDGALLAFV GRDEGPGEVF VMPAEGGEAR RLTFLGATMR VCGWGRNGRE ILFASSASLP FSRMALLYAI RADGGEPRLL PTGPAVTISY GPTGGAVIGR NESDPARWKR YRGGRTGDVW IDPDGTGEWR RLIALPGNIA IPLWVGERIY FVSDHEGVGN LYSCLPTGDD LQRHTWHREY YARFPSTDGR RIVYHAGADL YLFDPATNTS RRIEVELRSP RTQRKRRFVD PARFLQSVAL HPEGHSLAAI VRGKPFTFGN WEGAVLQHGD PGAVRYRLAD WLPDGRRIVV VSDAVGEETL EIHSVAFSNG GQRAAASDAA AVDAALSPFD APVRLEGLDI GRPMALAVSP KAPLVAVANN RNELLLVDLT ERTVRLLDRS RYASMLGIAW SPDGRWLAYG FWETGQTSII KVCEVATGTI TPVTRPVLVD RSPAFDPEGK YLYFISSRDF DPVYDDMHFD LGFPRGTRPF LVTLRANLRS PFVPRPHPLD QATPKPSVGE AKPAGETGGD APGAEGAAQG TTKSEPMMTI DLEGIANRIV AFPLPVGRYR QIAGIPGKAL FTVFPVESAL GLSRMPGDSA VARGRLDVYD FETLSSETLI DGVSAFSLSH DSKTLMYRAG NRVRVVKAGE RPKDNSSEPG RKSGWVDLAR IKLMISPPAE WVQMYREAWR LQRDHFWTPD MSGVNWLIVY HRYLPLLDRV ATRGEFSDLL WEMQGELGTS HTYEYGGDYR PEPQYSLGKL GADLRYDAET DSYIVERIVA GDVWNERASS PLARPGVNIA PGDRLIAIGS WRVGRDVSPH EVLVNQAGCD VLLTFRKADG TLRAVTVKAL HDDTQARYRE WVERNRAIVH EATNGRVGYI HIPDMQAFGY AEFHRGFLAE VAREGLIVDV RYNAGGFVSP LVAEKLARKR LGYDVSRWGE PAPYPPESIM GPMVAIINEA AGSDGDIISH VFKMMKLGPL IGKRTWGGVI GIHPRDTLID GGVTTQPEFS FWSAEVGWQL ENHGVEPDIE VEMRPQDYVA GADPQLDRAI AEVLRLMNDN PPRLPEFGER PRLPLPEEEC DEERSGR
|
| |