Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_4223 |
Symbol | |
ID | 5211208 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | + |
Start bp | 5287260 |
End bp | 5289281 |
Gene Length | 2022 bp |
Protein Length | 673 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640597812 |
Product | hypothetical protein |
Protein accession | YP_001278516 |
Protein GI | 148658311 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.84974 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCAGC CTGCACAGCG CCCAATCATC ACTCGTCGCC GTTTCCTTCA GGCGTGTGCG TGCACCGTCG CAGCTGGCGC GCTTGGCGCA ACCGGGTATG TTGCGGTCAA TGCGCCGCTT CCGGCGCCGC AGCCAGAGTC GGCGGCGTTG CTGGCGCAGG CTACGGGTCG CCCGCAACCA GATGCGCCGA TCCTGCTGGT CACCAATCAG CGGGCGATGC CGTCGTTTGG AGCGTATCTC GGCGCCATCT TGCGCGCTGA AGGGTTCGCC GCATTCCGTA CAGCCCGGAT CGATGATCTC GACAGCGCAT TGATCGGTCG TTTCCCGCTG GTGATCCTGG CTGCTGGCAA GTTGACCTCG CCTGAAGCTG AGATGTTCAG ATCATACGTT CTATCCGGCG GCAGGCTCAT TGCGATGCGT CCCGATCCGC AACTGGCCGA TCTTATGGGT GTACGCCATA CCGGGGGCGT GTTGGCGCGC GCGTACCTCT CCGCCGCCGA TCATCCACTG GCGCGAGGTA TTGACCGACA CGCCTTCCAG ATTCACACGC CGGTGGAGCA GTATCAACTG GCTGGCGCTG AACCGATTGC CTGGGCCAGT CAGCCTTACG GAAGCGCCAC ACGTTATCCG GCTGCAGTGG TCAACCGCGC CGGTAAAGGT ATTGCTGCTC TCTGGTCCTT CGATCTCCCA CACAATATCG CCCTGATACG GCAGGGAAAT CCCGGCGCCG CCAATCAGGA GCGCGACGGA CTCGACGGGA TTCGCACCAT GGATCTGTTC GTCGACTGGA TCGATCCGGA GGTGATAGAC GTACCGCAGG CCGATGAACT GCAGCGCCTG CTCGCCAACA TGATCCACGC CCTCACTGAC GCGATGCCGC TCCCACGAAT CTGGTACCTG CCGCAAGGGG CGACGGCGGT GCTTGTTGCG ACAGGTGACG CGCACGGGAT TCAGGTGGGA CATATTACAC CAGGGCTTGA AATTGTCGAA CGCTATGGTG GAACGATCTC GGTCTATTAC ACGACCCCCA GGGTCAGCGC CGCACGGCGC ACGGCACGGC GAATCCAGTG GTGGGCCGAA ACCCTTCCAG TCGTCGGGCA TGTTTTCGAA GAACATTCAG GCTACCCGAC GCCCCGCCTT GTTGAAAGCT GGCGCGAACA GGGCCATGGG TTCGGCCTGC ATCCATGGGT TGAGGAAGGT GTGGCATACG GCTATAACCG TGCCTGGAAC GATTTTGTGA AACATGGCTA CGGTCCCGCA GCTCCGACGG TGCGTACCCA CCGCGTGCTC TGGTCGGGTT GGGTTGATAC CGCGAAAGTG CAGGCGCAGT ACGGCATTCA TATGAGCCTC GATTATTACC ATGCCGGCCC GGCAATGCGC CGACCCGATG GTCGTTATGT CAATGGTTAC CTGACCGGCA GCGGCCTGCC GTTGCCGTTT GTCGACGAAC AGGGTCGCCT GCTGCGCGTC TACCAGCAGC ACACACATAT TGTTGATGAG CACCAGATTG CTGCATTCGA CCATGGATAT GAGATGAATC TGAGTGCGGC GGATGCTGTG GCGTTATCGC GACAACAGAT CGATCAGGCG GTCGAACGGT TTCCATCGGC GCTCGGCCTC CAGAGCCACT TCGATCCATA TGCATTTTCG CCCGAAAAGG CTGCCCTGGA ACGCGACTGG CTCGATGGTA TCCTGCATCA TGCCGCCAGT CGTGGTGTTG CCATCATGTC TGCTGAACAG TGGCTGGCGT TTACAGAGAT GCGTCATGGC GCCGCGATGC GCGACCTGAC CTGGAATGAC GCCGAAGGGA TGCTGACGTT CGAGGCAGAG ACGGGAGGGA CAGCGCAGCA TCATCTGCTG CTGTTACTGC CGGTGGAACA CCGGCAACGC CCACTGCGCC AGGTGATGAT CGATGGCATG CTAACGAGCG CCGTGCGCAG GCAGGTCGGT GGTGTGACCT ATGGCGCAGT GCCGCTCGCT CGCGGGCAGC GTCGGGTACG GGCGTATTAT GCCAAATCTT GA
|
Protein sequence | MKQPAQRPII TRRRFLQACA CTVAAGALGA TGYVAVNAPL PAPQPESAAL LAQATGRPQP DAPILLVTNQ RAMPSFGAYL GAILRAEGFA AFRTARIDDL DSALIGRFPL VILAAGKLTS PEAEMFRSYV LSGGRLIAMR PDPQLADLMG VRHTGGVLAR AYLSAADHPL ARGIDRHAFQ IHTPVEQYQL AGAEPIAWAS QPYGSATRYP AAVVNRAGKG IAALWSFDLP HNIALIRQGN PGAANQERDG LDGIRTMDLF VDWIDPEVID VPQADELQRL LANMIHALTD AMPLPRIWYL PQGATAVLVA TGDAHGIQVG HITPGLEIVE RYGGTISVYY TTPRVSAARR TARRIQWWAE TLPVVGHVFE EHSGYPTPRL VESWREQGHG FGLHPWVEEG VAYGYNRAWN DFVKHGYGPA APTVRTHRVL WSGWVDTAKV QAQYGIHMSL DYYHAGPAMR RPDGRYVNGY LTGSGLPLPF VDEQGRLLRV YQQHTHIVDE HQIAAFDHGY EMNLSAADAV ALSRQQIDQA VERFPSALGL QSHFDPYAFS PEKAALERDW LDGILHHAAS RGVAIMSAEQ WLAFTEMRHG AAMRDLTWND AEGMLTFEAE TGGTAQHHLL LLLPVEHRQR PLRQVMIDGM LTSAVRRQVG GVTYGAVPLA RGQRRVRAYY AKS
|
| |