Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_4047 |
Symbol | |
ID | 5541558 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 5250349 |
End bp | 5252997 |
Gene Length | 2649 bp |
Protein Length | 882 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640896160 |
Product | hypothetical protein |
Protein accession | YP_001434098 |
Protein GI | 156743969 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.353342 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0213567 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGACAC GTTTTGCCCT CGTTCTTGCA CTGGTCATCG GTAGTGTGTG CGGTGCGGCG GCTTCTGCCG CTGCCGGTGC GATTGCCGTT GTTCGTGGTG GTGACGAAAC GCTGGTTGTC CAGGCGACAA CGAGCGGAAC GCGGATCATC TGGCGTCCTC CGGCGTCCGA TCCGTCTCTG CATGTCGAAC CGGTTCTGGT CGCGTTGCGC CTCACCGGTG ATGCGACTAT CGCGCCTCGC CTGCTGGCGC TCGATGATAC GCCGTGGACG GGCGACTTCG ACGATCCGCC AGGTGCGCCG GTATTTGTGC TGCGCGAGGC GCGTCAGCGC GGTGAGCGTC TGGCTGTGCT GGCACTCAGT CCGGTCTATC TGCGCGAGGG GCAGGCGCGC GCGGTGCGCA CGCTCGAAGC GCTGGTCGAG GGCGCGGCGC CGCTGAGTGA CTCACCTGTT CCCGTCGCTG CATCATCACT TGCAGCAGGA GAGCCCGCTT CTGGGCGTCC GCCAGCGTTT CCGGCGCCTC CGGCGCTGCG GGTGCGGGTG GACCGTGCAG GTGTGCAGGT CATCCCCGTC AGTTCTCTGA GTTCGGCGAT CGCCGGAGCG CCGGAGCGCC TCAAACTGAC CCGCGCCGGG GTGGAAATCC CGCTCGAACT GCGCGACGCG AGCGGTAATG GCGTCTGGGG CGACCCAGAC GACGAACTAC GCTTTTATGC GCCGCCGCCG GGTAACCGCT GGAACCGCAG CGATACGTAT TGGATCACGC TCGAAGCAGG ATCGCGATTG CGCATTGCGT CCCGCGCGGT GAGTGCGCCA TCGGGCGAAG CGCCATCCAC TGCGCTGGAA CGGGGTGTTG TGCGCGGAAC GGCCTACTAC GACTCGCGGC GCCCTGGCAG TGATGGCGAT CACTGGTTTG CGAAGCTGCT GCGCGCTGAG GCGGGACAAC CGGCGGATGA CCAGGCGATG CTGTCCGTTC CGCTCACGAC GACCCTTCCA ACCGCAACTG GAACGGTGAC GCTGACGGTT GCGGTCCACG CGCAATCGGA TGGTGCGCGT CGCCTGACCG CCGCCATCGA GTCGAGCAGC GGATCGCCGG TTGAGTGGAG CGGCAGCGGA GATGCACTCC TGACGCTCAG TGTAGCCGGT AGCCCTGCTG CCACGACGCA AGTGCGCCTG ACGCTGACCG CTGTCGTCGG GTATGCACAG GTGGCCGTGG ATACGGTCGA ATGGATGCGA CCGGTGCAGT TGCAGTTCGG CGGGAAAGGC GCTGTCTTTC AGGGCGCGCC GGATCAGCGC GCCTACTGGT TGACTGGTGC GCCTTCCGGG TTCGATCTCT ACGATATTAC CGATCCTGTG ATGCCGACGC GGTTACAGAT GCCCGCCGGT TCCGCATTCG AGGATAGTGC GCCGGGGAAA TTGTATCTGC TGACCGGCGT CGGCACACGA CACACGCCGA CGGTCGAGCC GTTCACGCCG CCAACGCTGC CAACCGATGC CAGTGTGCTG TACATCGCTC CCGCGCCGTT CCATGCTGCG CTGACGCCGC TCGTGGACCT GCGGCGCGCG CAGGGGTACA GCGTGGCGGT TGTGGATGTG CAGCACCTCT ACGACGGATG GAGTGACGGT CAGGTCGATC CTGATGCGAT CCGCGCCTTT CTGCAATTTG CGCGTCCCCA GGCAGTGACG CTGGTTGGCG ACGGGAGTTC TGATCCGTTC GACTATACCG ACCGTGGTGC GAAGAATGTC AACCTGATCC CGCCATATCT GGCGATGGTC GATCCGTGGC TGGGCGAAAC CGCCTGCGAA ACGTGTTACG CGCAACTGGA CGGCGAGCGA CCGACCGATG ACCGGCTGCC GGATGTCTGG CTTGGGCGGC TGCCGGCAAA GAGCGTTGCT GAAGTGCAGT TGCTGGTGGC CAAGATCATC AGGTACGAAA CGTCTCCATC CGGCGGCGCA TGGCGCAGCC GCGCGCTCTA CCTGGCGGAT GATGCGGACA CCAGCGGCGA TTTTGTGGCT CAGGCGGAAG CGAGCATTGC GCTGCACCCG GTAGGCGTCC AGATCGGTCG GGTGTTTTTT GGCAACGGTG CGGGAGCGTT TCCAACCGCT GCCGCAGCGC GGACTGCTAC GCGGACACAG TTCGACAACG GCGCGGCGGC GGTGGTCTAC ATCGGGCACG CGCATCAACA GCAGTGGGCG GTGACGGAGT TGAGCGCGCC GGAGAACTGG CTACTCCATC GAAATGATGT CGCGGCGCTG ACCAATGGCG AGCGCCTGCC AGTGGTACTG GCGCTCACCT GCCTGAGCAG CGCCTTTCAG TGGCCCTCAT ACGTGGGCAT GACTGTTGAT GAGGCGCTGC TGCTGCACGA GAAGGGCGGC GCTGTGGCAG TCTGGGGACC GACGGGGCTG GGCGTGTCCT ACGGGCACGA CAAACTGCAA CGGGGATTCT TCCGCGCCCT CTGGTCACCC GCGCCAGATG TGGGGATTGA ACGCGCCGTG CCGCTTGGCG CGTTGACCAG CGCCGGGTTC CGTGACCTCT TCACCGGAAG CGCCTGTTGT CAGGAAACGA TATTCACCTA TGCGCTGCTG GGCGACCCGC TCACGCCGCT GCGGATGACG GCGGGGACGC GTGTGATGTT GCCGCTGGTG CAGCGGTAG
|
Protein sequence | MMTRFALVLA LVIGSVCGAA ASAAAGAIAV VRGGDETLVV QATTSGTRII WRPPASDPSL HVEPVLVALR LTGDATIAPR LLALDDTPWT GDFDDPPGAP VFVLREARQR GERLAVLALS PVYLREGQAR AVRTLEALVE GAAPLSDSPV PVAASSLAAG EPASGRPPAF PAPPALRVRV DRAGVQVIPV SSLSSAIAGA PERLKLTRAG VEIPLELRDA SGNGVWGDPD DELRFYAPPP GNRWNRSDTY WITLEAGSRL RIASRAVSAP SGEAPSTALE RGVVRGTAYY DSRRPGSDGD HWFAKLLRAE AGQPADDQAM LSVPLTTTLP TATGTVTLTV AVHAQSDGAR RLTAAIESSS GSPVEWSGSG DALLTLSVAG SPAATTQVRL TLTAVVGYAQ VAVDTVEWMR PVQLQFGGKG AVFQGAPDQR AYWLTGAPSG FDLYDITDPV MPTRLQMPAG SAFEDSAPGK LYLLTGVGTR HTPTVEPFTP PTLPTDASVL YIAPAPFHAA LTPLVDLRRA QGYSVAVVDV QHLYDGWSDG QVDPDAIRAF LQFARPQAVT LVGDGSSDPF DYTDRGAKNV NLIPPYLAMV DPWLGETACE TCYAQLDGER PTDDRLPDVW LGRLPAKSVA EVQLLVAKII RYETSPSGGA WRSRALYLAD DADTSGDFVA QAEASIALHP VGVQIGRVFF GNGAGAFPTA AAARTATRTQ FDNGAAAVVY IGHAHQQQWA VTELSAPENW LLHRNDVAAL TNGERLPVVL ALTCLSSAFQ WPSYVGMTVD EALLLHEKGG AVAVWGPTGL GVSYGHDKLQ RGFFRALWSP APDVGIERAV PLGALTSAGF RDLFTGSACC QETIFTYALL GDPLTPLRMT AGTRVMLPLV QR
|
| |