Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17025_4151 |
Symbol | |
ID | 5086323 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17025 |
Kingdom | Bacteria |
Replicon accession | NC_009430 |
Strand | + |
Start bp | 195801 |
End bp | 197108 |
Gene Length | 1308 bp |
Protein Length | 435 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640485713 |
Product | transposase, IS4 family protein |
Protein accession | YP_001170307 |
Protein GI | 146280150 |
COG category | [R] General function prediction only |
COG ID | [COG3550] Uncharacterized protein related to capsule biosynthesis enzymes |
TIGRFAM ID | [TIGR03071] HipA N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACACGGC GGGTCCCGGT CTGGTTCGAC AGGCTCCATC TGGCGGATAT CGAGGTCGCA GCCGATGGCG CACTCTCGCT GCGCTATGCC GAGCGCTGGT GCCTCACCGA CGGGGCGTTT CCACTCTCCG TCACCATGCC GCTGCGTGCC GAGCCTTATC CGTCCGAGGT CGTCGCGCCC TGGCTCGCGA ACCTTCTGCC GGAAGAGGAG CCGCTCCGCA TCCTGACCCG CTCGCTGGGG CTCGACCAGG CCGATGTACT GGCCCTGCTG GAACAGATCG GCGGGGATAC GGCGGGGGCG CTGTCCTTTG GCACGCCCAC GGACCGCGCC CGTTGGGCCT GGCGGTCGCT GACCGACCAC TATGGCCGCG ACGACCCGGC CGAGGCGCTC GAGCGTCATT TCGAAGATCT CGGGCGGCGA CCCTTTCTGG TGGGAGAGGA GGGGGTTCGG CAATCGCTGG CGGGCGGCCA GAAGAAATCC GCCCTTGCGG TGCTGGATGC GCAGGGAAAC CCCGTTCTGC GCCTGCCCGG ACCGGACGAT GTGCTGGCCA TTCCCCTGAA CGGTGCGCCC TCGACCCTGA TCGTGAAGCC GGACAATCCG AACCTGCCGG GCCTGACCGA GAACGAGGTC TGGTGCCTGA GACTGGCCTC TGCCATCGGC ATCCCGGCGG CGGAGGCGAC GATCCTGCGG GCCTCGGGCC GCAGTGCCAT TGCCGTCCTG CGCTATGACC GCAGACTGGG GCGACAGGGA CAATTGCAGC GCCTGCATCA GGAGGATTTC GCGCAGGCCA ATGGCCTGCC GCCGGGGCGG AAGTACGAGC GCGGCACGCG ACCCGGACTC AATCTCGCCA CCCTCCTGCG CACGGCACGG CATGTGAGCG TGACAGATGC CCTCGCGCTG CTCGACCAGG TGATCTTCAA CATCCTCGTC GCCAACACCG ACGCCCATGC CAAGAACTAT TCCCTGATCC TGCCGATTGC CGGGCCACCG CGTCTGGCAC CGCTCTATGA TGTCTCCTCC GTGCTGTCCT GGCCGCATGT CGTGCAGGCC TACGCCCAGA ACATCGCCGG AAAGAAGCGG ACATCGGAGG GGATCGCGGC GCGCCACTGG GCAGCCATCG CCAAAGAGGT CGGCTATCGC CCGCGGGACG TGCTCAACCG TGTCCAGGAC CTGATTGACA GGATCGTCGC GCATCGCGTC GGGGTGACGG AGGAGGTCGC GCGTCTGCCC GGCGCGACCG AAGGGTATGT GGCGCAGACG GCGGAGCTGG TGGACGGCAA CGCGCTTCGG ATGGCCGGGC GTCTTTAA
|
Protein sequence | MTRRVPVWFD RLHLADIEVA ADGALSLRYA ERWCLTDGAF PLSVTMPLRA EPYPSEVVAP WLANLLPEEE PLRILTRSLG LDQADVLALL EQIGGDTAGA LSFGTPTDRA RWAWRSLTDH YGRDDPAEAL ERHFEDLGRR PFLVGEEGVR QSLAGGQKKS ALAVLDAQGN PVLRLPGPDD VLAIPLNGAP STLIVKPDNP NLPGLTENEV WCLRLASAIG IPAAEATILR ASGRSAIAVL RYDRRLGRQG QLQRLHQEDF AQANGLPPGR KYERGTRPGL NLATLLRTAR HVSVTDALAL LDQVIFNILV ANTDAHAKNY SLILPIAGPP RLAPLYDVSS VLSWPHVVQA YAQNIAGKKR TSEGIAARHW AAIAKEVGYR PRDVLNRVQD LIDRIVAHRV GVTEEVARLP GATEGYVAQT AELVDGNALR MAGRL
|
| |