Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17025_2970 |
Symbol | |
ID | 5085173 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17025 |
Kingdom | Bacteria |
Replicon accession | NC_009428 |
Strand | - |
Start bp | 3034308 |
End bp | 3036092 |
Gene Length | 1785 bp |
Protein Length | 594 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640484541 |
Product | transposase, IS4 family protein |
Protein accession | YP_001169161 |
Protein GI | 146279002 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG5421] Transposase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.210297 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTGCCACT CTGGTGTTGT GGCACGTGAA ATCAGGAGAA TTGCTGGAAA TAAAGCCTCC GGAGCTGCTA CTGTTGCGGC ATACACGAAG ATCACCCGCT CCGGTGGCCG CAGCTACCTG CAACTCGTCG AGGGCTTTCG GGAGGAGGAG GGAAAGGTTC GCCATCGCGT CGTGGCCAAC CTCGGCCGGC TCGAGGAACT GACGCCGGCG AAGCTCGATC CGTTGATCAA CGGCCTGAAC CGCGTCCTCG GGCGCGCCGA GAACACCGGC TTCGAGATCG CTCAGGAGAG CGCCCGCGCC TATGGCGACG TCTTCGCGCT GCACGAGTTG TGGAAGGATC TCGGCTTTGA CCGCGCGCTG GCCGCAGGGA TGCGATCCGG GCGCCGCAAG ACCGACGTCG ACGCCCTGGT GCGCGCCATG GTCTTCAACC GGCTTTGCGC CCCCGACAGC AAGCTCGGCT GCCTGCGCTG GCTGGAGACG GTGGCCATGC CGTCCATGCC CGAGACGGTC ACGCATCAGC ACCTGCTGCG GGCGATGGAC GCGCTGATGG ATCACGCCGA GCGGGTCGAG ATCGAACTGG CCCGCCAGAT CCGGCCGCTC GTTGATCGCG ATCTGGCGGT GGTCTTCTAT GACCTGACCA CCGTGCGCAT CCACGGCGAG GCGGAGGTGG ACGACGACCT CCGCGCCTTC GGGATGAACA AGGGTGAGCC GCAGGCGAAC GGAGGCATCG CCCGCCAGTT CGTCCTGGGC GTCGTCCAGA CGGCCGGCCT GCCGCTCATG CACACCGTCC ACCCCGGCAA TGTGGCCGAG ACGAAGACGC TGCAGGCCAT GCTGACCACC GTCCTGCAGC GTTTCCCGGT GCAGCGGGTG ATCCTCGTCG CCGACCGAGG GCTGCTGAGC CTCGACAATA TCGATACCCT CACCACGCTC GCCGATCAGG GCGGCCGCAA GCTCGAGTTC ATCCTGGCCG TCCCCGCCCG CCGCTACGGC GAGCTGGTCG AGACCTTCCG GGGCCTCGCG TTCGACGAGG CGGGCCTGGC CGAGGCCGGC TTCGCGGGCC ACCGCCTGAT CGTGGCGCAC GATCCGCTGC GCGCGGCCGA GCAGTCCGAG AAGCGCCGCG CCCGCATCGC CGAACTGGAA ACCCTGGCCG AGCGCATGGT CGGCAAGCTC GACGCCCAGG AGTTCGGACC GGACCGCGCA GCGGCGGATC CTCCAGTGGA GGATCCGGGG CCGGACGGGG CGACCGACCG AGGCCGCCGG GCCTCCGACC GCGGTGCCTA CAGCCGGTTC ACCCGCGCGG TCGCCGAGGC AGAGCTGACC CGCTTCCTCC AGGCCGACTT CACGGCCGAC CGCTTCAGCT GGTCGCTGGA CGAGGCCGCC ATTGCCGAGG CCGAACTCTT CGACGGCAAG CTCGCGCTGC TGACCAATGC GCCCGACCTG ACGCCAGTTG ACGCGGTCGT GCGCTACAAG GCGCTGGCCG ACATCGAGCG CGGCTTCCGG GTGCTGAAGT CCGACATCGA GATCGCCCCG GTCCACCACC GCCTGCCCGA GCGCATCCGC GCCCACGCGC TGATCTGCTT CCTCGCGCTC ACGCTCTATC GCGTCATGCG CATGCGGCTG AAGGCGAAGG GGCACGCGGC GAGCCCCCGC ACCGCGCTCG ATCTGCTGGC GCGCATCCAG AAACACAGAA CCCATATCGG CCCCGGAACT TTCGAAGGAG TCTCTCGTCC CGAGCCCCGG CAACTCGAAC TCTTCTCTGC CCTCAGTCTT CCAAAGCCCG TCTGA
|
Protein sequence | MCHSGVVARE IRRIAGNKAS GAATVAAYTK ITRSGGRSYL QLVEGFREEE GKVRHRVVAN LGRLEELTPA KLDPLINGLN RVLGRAENTG FEIAQESARA YGDVFALHEL WKDLGFDRAL AAGMRSGRRK TDVDALVRAM VFNRLCAPDS KLGCLRWLET VAMPSMPETV THQHLLRAMD ALMDHAERVE IELARQIRPL VDRDLAVVFY DLTTVRIHGE AEVDDDLRAF GMNKGEPQAN GGIARQFVLG VVQTAGLPLM HTVHPGNVAE TKTLQAMLTT VLQRFPVQRV ILVADRGLLS LDNIDTLTTL ADQGGRKLEF ILAVPARRYG ELVETFRGLA FDEAGLAEAG FAGHRLIVAH DPLRAAEQSE KRRARIAELE TLAERMVGKL DAQEFGPDRA AADPPVEDPG PDGATDRGRR ASDRGAYSRF TRAVAEAELT RFLQADFTAD RFSWSLDEAA IAEAELFDGK LALLTNAPDL TPVDAVVRYK ALADIERGFR VLKSDIEIAP VHHRLPERIR AHALICFLAL TLYRVMRMRL KAKGHAASPR TALDLLARIQ KHRTHIGPGT FEGVSRPEPR QLELFSALSL PKPV
|
| |