Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_0444 |
Symbol | |
ID | 5207380 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | - |
Start bp | 563079 |
End bp | 564197 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640594065 |
Product | transposase, IS4 family protein |
Protein accession | YP_001274820 |
Protein GI | 148654615 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.000119461 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0439577 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCGACA CGTACCGCCG GTATCGTGCC ATAGCTCAGT GTTTGCTGCA ACTCTATCCC CAGGTCGGTG GGCATCAACG GCGCCATCTG GCGACCTTGG CGCTCTTGAT CTGCGGGATT GTCGGCAGCC AACACACCCA ATTGCCAAAA GTGGTTGAAC GGACGCCTGG CGGACGCGCC GCCGACGAGA GTGTCGTGAT GCGTTTTCGA CGCTGGCTCA AACACGACAA CGTAACCTAC AAGCGCTGGA TGCTGCCCGT TGCCCAAGCA CTTATCGCCA TGTTGGGGCG TCGACCATTG GTGTTCGTCA TTGATGGGAG TACCGTTGGG CGGGGATGCA TGTGCCTGAT GATCAGCGTG TTGTATCAGC GTCGGGCGCT TCCGATCACC TGGCTCGTGG TGAAAGCGCG CAAAGGCCAT CTGCCAGAAG CACTGCATTG TGCGCTGCTC GAGCAACTCG CTCAGCTCGT TCCGGCCGAG GCGAGCGTGA CGATCTTGGG GGATGGTGAA TATGATGGCG CCGATTGGCA AGCCGCGATT ACTGCGCGCG GGTGGAAGTA TGTCTGCCGA ACCGCAAGCA ATATCCTGCT GACGCTGGCG GAGGCGACTA TTGCTCTTGG CGATCTCGCG CCGAAGCGTG GCGAGGTTAT CGCCGTCGAG CAGGTCTGCA TAACGGCCGC ACAGTACGGT CCGGTTAACG TGCTGGCGGT GTGGGAAGCG GCCTACGAGC ATCCAATCCA TCTGGTGACG ACGCACGCTG ACGTGGCGTA TGCCTTGGCC TTGTATCGCC GCCGTGCGCA GATCGAAACC TACTTCTCGG ATCAGAAGAG TCGCGGCTTT CGGATCAACC GTAGCCATAT CAGTGATCCG ACACGACTTG CGCGCCTGTT GATCGCGACC GCGCTGGCGT ATCTGTGGGT CGTCTATCTG GGCGTGGTGG CGAGACGGGA TGCGCTGCGT GGGCGCATCC ATCGACCGGA TCGCTGCGAT CTCAGCTTGT TCTCGCTTGG CTTGCGGCTG CTAGCCTACT GTCTGCGCCA TCGACGAACC ATCCCGCGCG GGTTGCCCAA ACCACTCTTC ACGGCATGTC AAACCGCGTT CATATGTTCT GTACGGTAG
|
Protein sequence | MRDTYRRYRA IAQCLLQLYP QVGGHQRRHL ATLALLICGI VGSQHTQLPK VVERTPGGRA ADESVVMRFR RWLKHDNVTY KRWMLPVAQA LIAMLGRRPL VFVIDGSTVG RGCMCLMISV LYQRRALPIT WLVVKARKGH LPEALHCALL EQLAQLVPAE ASVTILGDGE YDGADWQAAI TARGWKYVCR TASNILLTLA EATIALGDLA PKRGEVIAVE QVCITAAQYG PVNVLAVWEA AYEHPIHLVT THADVAYALA LYRRRAQIET YFSDQKSRGF RINRSHISDP TRLARLLIAT ALAYLWVVYL GVVARRDALR GRIHRPDRCD LSLFSLGLRL LAYCLRHRRT IPRGLPKPLF TACQTAFICS VR
|
| |