Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_1994 |
Symbol | |
ID | 5539472 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 2559648 |
End bp | 2561591 |
Gene Length | 1944 bp |
Protein Length | 647 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640894129 |
Product | hypothetical protein |
Protein accession | YP_001432100 |
Protein GI | 156741971 |
COG category | [R] General function prediction only |
COG ID | [COG3211] Predicted phosphatase |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0125565 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTGGAC GCGACGAGCA AGACAAACTG ATCGTCCGAT CGGTCGAGGG GAAAGGCAAG GAATTCGAGC GCATTCTTGA AGAGCGTCTG AGTCGCCGTG ATTTCCTCAA AGCCGCAGCA GTCACATCGG GACTCGTCGT TGCTGCAACC GCGATGAACG CCGATGTTGC CGCAGCGCAG ACGCGCCCGG CGCCGTTGCC GCCAAAGTTT GGCAAGGTTG CGCCGACTAC GCCGGAAGTG GACGAGATCG CTGTGCCGGA TGGCTACTAC GCCGCAACGC TCATTCGCTG GGGGGAGCCG ATCTTTGCCG ACGCGCCAGA GTTCGATGTC TGGACGCAGA CAAAGGAGAA GCAGGAGAAG CAGTTCGGCT ACAATTGCGA CTACGTTGGC TACTTCCCGC TGCCGTCGTA CACCTCGAAT AACTCGACGC GCGGGTTGCT GGTGGTGAAC CACGAGTATA CCAACCCAGA GTTGATGTTC CCCGGCTACG ATGTCGAGAA TCCCAACCCC ACCAGAACGC AGGTCGATGT CGAACTTGCT GCGCACGGTG TGTCGGTGAT CGAGGTTGCC CGCGGGCGCG ATGGGCGCTG GAATGTGGTG CGCAATTCGC CCTATAACCG CCGTCTGACC GGTTACACGC CGATGACGGT CAGCGGTCCG GCGGCCGATC ACGAGTGGAT GAAGACGAAT GCCGATCCGA CCGGGCGCAA TGTGCTTGGC ACCCTTAACA ACTGTGCTGG TGGCAAGACG CCGTGGGGCA CGGTGCTGAC CGCCGAAGAG AACTTCCACC AGTATTTTGC GAACCTGCGA GCGATGCCGA ACAGCGATTA TCGCAAGGCG ATCCATAACC GCTACGGCAT GCCGAGCGGC GCCTCTGAGC GCAGGTGGGA GAATTTCCAT GATCGCTTCG ATATTGCAAA GGAGCCAAAC GAAGGCTTCC GTTTCGGCTG GATCGTCGAG TTTGATCCGT ATAATCCCAA TTCTGTGCCG GTGAAGCGCA CCGCACTCGG TCGCTTCCGC CACGAAGCGG CAACGATCGT GATTGCGCCA TCCGGTCAGG TCGTGGCGTA CTCCGGCGAC GATGCGCGCG GTGAGTATGT CTATAAGTTC GTCTCGAACG GGCGCTACAA CCCGCGCAAC CGTGCGGCGA ACTTCAACCT GCTCGACGAT GGTACGCTCT ATGTTGCGCG CTTCAATGCG GACGGCACCG GTGAGTGGTT GCCGCTGAAG CACGGCTTTG GTCCGCTGAA TGAGGGCAAT GGTTTCATGT CGCAGGGCGA TGTGCTGATT AAGACGCGCA ACGCCGCCGA TGCGCTTGGC GCAACCAAGA TGGATCGCCC GGAGGACATC GAGACCAATC CGGTCAACAA GAAGGTCTAC ATTATCCTGA CGAATAACAG TCAGCGTGGC GTCGGCAACG GTCCGCCGGT TGATGCCGCC AATCCGCGCG CCAACAACCG CGCCGGTCAC ATTATCGAAC TGACCGAAGA GGGCAATAAC CACGCGGCGA CCCGCTTCAC CTGGAATATC TTCATTCTGG CGGGTCTGCC GACCGACGAG TCAACCTACT TTGCCGGCTA CGACAAGAGC AAGGTCAGTC CGATTGGCGC GCCGGACAAC ATTGTGTTCG ACCTGGCGGG CAACGCCTGG GTCGCTACCG ATGGCGCCGC CAGCGCCATT AAACTGAACG ACGGCCTGTT TGCCATCCCG GTTGCCGGTT CGGAGCGCGG GCATGTGCAG CAGTTCTTCT CGTCGGTGGC GGGCAGCGAG GTGTGCGGTC CGGAGTTTAC CCCCGATAAC CGGACGCTCT TCCTGGCGAT CCAGCATCCG GGTGAGGGCG GCACATTCGA TAAGCCGATC AGCACCTGGC CCGACCGCCA GGGTCTGGCG CGTCCGAGTG TCATTACCAT TCAGGCGTTC GATAACCGGC GTATCGGTCG CTAG
|
Protein sequence | MGGRDEQDKL IVRSVEGKGK EFERILEERL SRRDFLKAAA VTSGLVVAAT AMNADVAAAQ TRPAPLPPKF GKVAPTTPEV DEIAVPDGYY AATLIRWGEP IFADAPEFDV WTQTKEKQEK QFGYNCDYVG YFPLPSYTSN NSTRGLLVVN HEYTNPELMF PGYDVENPNP TRTQVDVELA AHGVSVIEVA RGRDGRWNVV RNSPYNRRLT GYTPMTVSGP AADHEWMKTN ADPTGRNVLG TLNNCAGGKT PWGTVLTAEE NFHQYFANLR AMPNSDYRKA IHNRYGMPSG ASERRWENFH DRFDIAKEPN EGFRFGWIVE FDPYNPNSVP VKRTALGRFR HEAATIVIAP SGQVVAYSGD DARGEYVYKF VSNGRYNPRN RAANFNLLDD GTLYVARFNA DGTGEWLPLK HGFGPLNEGN GFMSQGDVLI KTRNAADALG ATKMDRPEDI ETNPVNKKVY IILTNNSQRG VGNGPPVDAA NPRANNRAGH IIELTEEGNN HAATRFTWNI FILAGLPTDE STYFAGYDKS KVSPIGAPDN IVFDLAGNAW VATDGAASAI KLNDGLFAIP VAGSERGHVQ QFFSSVAGSE VCGPEFTPDN RTLFLAIQHP GEGGTFDKPI STWPDRQGLA RPSVITIQAF DNRRIGR
|
| |