Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_2950 |
Symbol | |
ID | 5540440 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 3826018 |
End bp | 3829002 |
Gene Length | 2985 bp |
Protein Length | 994 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640895070 |
Product | hypothetical protein |
Protein accession | YP_001433029 |
Protein GI | 156742900 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.00290707 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCATCGCA CTCCGCCCTT GTTCGTGTCG GTGTACGCAC TCCTTATCTG CGCGCTCTTC GCCACCAATT GGAGCCTCGC CGCTTCGCCT TCGGAACATC GCCCGTCGGT ATCGATGACA TTTGTATCCG GCGATACGCC TGTGCGGCTG GATGCACAGG TCGAGCCTGA CGCTACGGTT GTCTGGCGGA ACAATGACAC CAGACCGCAC CGACTGCGAT CGGTCGATGG CAGTTGGACA TCACCGGTGA TTGCGCCGGG GGATGTGCAG CGTCAGTGGT TTGCGCAGCC TGGACGTTAC CCGTTTGTGT GCGACTTCGA TCAGGATATG CGCGGTGAAC TGACGGTTCA GTCCGGCGCT TACCGCGTGT TTCTGCCGCT GGTGGCGCGT GGCATGCCCG CAGGACCATC CGGTGAGCGT TGGTCGAACC CGGCGACGTG GGGTGGGCGC CTGCCGCAGG CGGGTGAGGC AGTGCGCATT CCGCCGGGCA AAACCGTTCT CCTGGATGTC AGTCCGCCGC CTCTGCGCAG CCTGACGATT GAGGGAAGCC TGGTGTTCGA CAACCGCGAT CTCGACCTGA GCGCCGGCTG GATTATGCTG CATGGCAACG GGCGATTGCG CATTGGCGAT CCGGCGGCGC CGTTCCGCCA TCGCGCGACC ATTACGCTGA CGGCGCCCGA TCCTGATGAG GACGTGATGG GCATGGGAAC GCGCGGTATT CTGTTGATGG GTGGAACGTT CGAAGCCTAT GGCCTCGCGC CGGTTCCGGT CTGGACGACT CTTGCCGATC ACGCCGACGC CGGAGCGACT CGGCTCACCC TGCGCGATGC CGTCAACTGG CAGGCTGGCG ATCAGATTGT GGTTGCGCCG ACCGACTTCT ACGGCGTGGC TGAGACGGAG CAGTTGACCG TTCAGACAGC GGACGGTCCT CAGGTCGATC TGTCCACACC GCTGCGACAA GAGCGATGGG GACGGTTGCA GTACGTGAGT GCGGCAGGAA TGATCCTGAC GCCAACGACG GACATCACCC CGCTTGCGCT CGATGAACGC GCAGAGGTGG GCAATCTATC GCGCCAGATC GTCATTCAGG GAGCGGACGA TGCGCGCTGG CGCAATGAGC GCTTTGGCGC GCACATCATG GTGATGAACC ATGCCGTGCT GCGTCTCGAC GGCGTCGAGT TGCGGCGCAT GGGGCAGGGT GGGCGTTTGG GACGCTATCC GATCCATTTC CATCTGCTCT CGTATGCGCC CAATGGCGCA CTGATCGGAG ACGCAACAGC GCAGACAGTG ACCAACTCGA GCATCTGGAA CTCCGCCAAT CGCTGTATTG TTATTCACGG CACGAATGGC GTAACGGTGC GGAACAATAT TTGCTACGAC ATCGCCGGGC ACGCAATCTT TCTGGAAGAC GCGGTCGAGC GCCGCAATCT GATAGAGCAC AATCTGGTGT TGAAAGTACG TCAACCGCCA CAACCGCTGT TGCCAAGCGA CCGGATAATG TTTCGCCGTG GACCGTCGGG TTTCTGGTTG ACGAACCCGG ATAACACCGT GCGCGGCAAT GTCGCTGCTG ATGCTGCCGG TAATGGCTTC TGGCTGGCGT TCCCCGAAAA ACCGCTCGGA GATAACCAGA ATGTTCCGAT TCGCCCGATA AATACCCTTT TGGGCATTTT TAGTCATAAT GTCGCACATT CGAATAATCG CCCAGGAATC AATATCGACT TTGCGCCGAT TGATAATGCG GGCAATACTG CCGAAACAAA ATACATCCCT ACCATTGATG GAGGACCTTT CAGGTATGAA AACCGCATAC GATTCACGCT GAGCGATATT ACGACCTACA AGAACAATGA CAACGGATTG TGGAACCGGG TTTCGTGGCC GAACTACGTT CGGTTTGTTT CAGCCGATAA TGCCGGCATG TTCTTTGCCG GCGCCGGCGA CGATGGCAGA ATCATCGACT CGCTGATCAT TGGTACGAGC CTGAATAATC GCAATCCGTC ACCAACGTCA TTCTCCGGCG ATCAACCGAA TACTGCGATT GCCAGTTATC ATAGCACATT CGACATTTAT AACAATGTCG TCGTCAATTT CCCGCTGAGT CCGAGAAATG ACCGCGCGAG CGGCGCGTTT GCCACCAACG ACTACTATAC CCGACCGGTT GATCGCGGTC TGGTGCGCAA TCCCAATAAT CGGCTGATCA ACTCCCATCC GGGGCGTCGG GTCATTTCGC CCAATCTGAA CACGCCGGTC GGCAACGCTG CGCTCGCCGG CGCGCTTTGG GATCCGTATG GCTACTGGGG ACCGGCAGGT AACTATTGGG TGTACGATGT GCCATTTCTG ACGGCCGGTC GCCCCTGTGC GCCGGTCGCA CCGGCCGGGC AGAACGGGCA GAGTTGTGTT GGACCGTATT ATGGCGTCGC TGGTTTTCGT ATCGATGGAG GCGACCCGTT TAAGCCACGT ATGCCACTGA CCGTTACCCG GCTCGATGAT GCGCTTCAAC CCATGGCACA ATGGATCGTG GAGGAGGGGA GTGGCAGCGG ACGGAACACC TTCGGGATTA TGCCGTGGAT GCGGCATTTT GCAGCCGTTC CTGGAGGACG CTATCTGATC GAGTTCCGTG ATGCTGCCAA CAGCCTGCCG CCGCCATCGC ACGAAGTAAA ACTGGAGTTG ACAAATATGC ACTCTCCTGC CGATCAGGTC ATCCTTGCCG TTCCGTTCAG TGGCAGCGTG ACGGCGCGCG CCTACCTGAC GACGCGGCAA TCCTATGAGT ACGCTGCGCC GGGATCGCCT GATCGCCGCG ATCTGACGCC GGTCTCATCG ATGGCGGCGC TGCTGGCGAC GGATAACAGT ATGTGGCAAG ATACGGTCAA TCAGCGTGTG TGGGTTCACG TTCGCGGTGG GGTTCCATGG TCGGGAGGAG AGCCGACGAA TCCGCTCTCG GACGCGGCGC TCTATCGGGA TACAATCCTG CGTATCGTTC GCTGA
|
Protein sequence | MHRTPPLFVS VYALLICALF ATNWSLAASP SEHRPSVSMT FVSGDTPVRL DAQVEPDATV VWRNNDTRPH RLRSVDGSWT SPVIAPGDVQ RQWFAQPGRY PFVCDFDQDM RGELTVQSGA YRVFLPLVAR GMPAGPSGER WSNPATWGGR LPQAGEAVRI PPGKTVLLDV SPPPLRSLTI EGSLVFDNRD LDLSAGWIML HGNGRLRIGD PAAPFRHRAT ITLTAPDPDE DVMGMGTRGI LLMGGTFEAY GLAPVPVWTT LADHADAGAT RLTLRDAVNW QAGDQIVVAP TDFYGVAETE QLTVQTADGP QVDLSTPLRQ ERWGRLQYVS AAGMILTPTT DITPLALDER AEVGNLSRQI VIQGADDARW RNERFGAHIM VMNHAVLRLD GVELRRMGQG GRLGRYPIHF HLLSYAPNGA LIGDATAQTV TNSSIWNSAN RCIVIHGTNG VTVRNNICYD IAGHAIFLED AVERRNLIEH NLVLKVRQPP QPLLPSDRIM FRRGPSGFWL TNPDNTVRGN VAADAAGNGF WLAFPEKPLG DNQNVPIRPI NTLLGIFSHN VAHSNNRPGI NIDFAPIDNA GNTAETKYIP TIDGGPFRYE NRIRFTLSDI TTYKNNDNGL WNRVSWPNYV RFVSADNAGM FFAGAGDDGR IIDSLIIGTS LNNRNPSPTS FSGDQPNTAI ASYHSTFDIY NNVVVNFPLS PRNDRASGAF ATNDYYTRPV DRGLVRNPNN RLINSHPGRR VISPNLNTPV GNAALAGALW DPYGYWGPAG NYWVYDVPFL TAGRPCAPVA PAGQNGQSCV GPYYGVAGFR IDGGDPFKPR MPLTVTRLDD ALQPMAQWIV EEGSGSGRNT FGIMPWMRHF AAVPGGRYLI EFRDAANSLP PPSHEVKLEL TNMHSPADQV ILAVPFSGSV TARAYLTTRQ SYEYAAPGSP DRRDLTPVSS MAALLATDNS MWQDTVNQRV WVHVRGGVPW SGGEPTNPLS DAALYRDTIL RIVR
|
| |