Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17025_2101 |
Symbol | |
ID | 5083560 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17025 |
Kingdom | Bacteria |
Replicon accession | NC_009428 |
Strand | - |
Start bp | 2140683 |
End bp | 2142314 |
Gene Length | 1632 bp |
Protein Length | 543 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640483664 |
Product | hypothetical protein |
Protein accession | YP_001168297 |
Protein GI | 146278138 |
COG category | [S] Function unknown |
COG ID | [COG4383] Mu-like prophage protein gp29 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0242078 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCAAGA CACCCGTCCT TCTCGACCGC TGGGGCAAGC CCGTGAAGCG CGCGGTCCTG ACAGAGGAGA TCTCGGCCGC CACCCTGGGC AGCGTGCGCA GCCCGATCAC CGGCTATCCG GCCGACGGGC TGAACCCGGT GCGGCTGGCC TCGATCCTGC GCGAGGCCGA CGCGGGCGAC CCGGTGCGCT ATCTCGAACT GGCCGAGACG ATCGAGGAGC GCGACCTGCA CTACCTCGGG GTCCTTGGCA CCCGGCGCCG ATCGGTCAGC CAGCTCGACA TCACGGTCGA GGCGGCCTCG GACGATCCGC GCGACGTGGA GATCGCCGAC ATGATCCGCG ACTGGCTCAC GCGCGACGAG CTGTCCGACG AGCTCTTCCA CATGCTCGAC TGCATCGGGA AGGGCTACAG CTTCACCGAG ATCATCTGGG ACACCTCCGA AGGCCAGTGG CGCCCGGCGC GTCTGGAGTG GCGGGACCCG CGCTGGTTCC GCTTCGACCG GGCCGCTCTG ACCACGCCGC TGATGCTCGG CCCGCACGGC GAGGAGCTGG AACTGACGCC GTTCAAGTTC ATCTTCGCCG AGGTGAAGGC CAAGTCGGGG ATCGCGTTGC GGTCGGGTCT GGCGCGGGCC GCGGCCTGGG CGTGGATGTT CAAGGCGTTC ACCCAGCGCG ACTGGGCGAT CTTCACCCAG ACCTACGGCC AGCCGCTGCG CCTGGGCAGA TACGGCCCCG GCGCGTCCGA AGACGACAAG GCCACGCTCT TCCGGGCGGT GGCCAACATC GCCGGCGATT GCGCGGCGAT CATCCCCGAG TCAATGGCGA TCGACTTCGT CGAGACGAAG TCCGTGGGCG CCACGGCCGA TCTCTACAAG CAGCGGGCCG ACTGGCTCGA CCAGCAGATC TCGAAGGCGG TGCTGGGCCA GACCGCCACG ACCGATGCCG TGACCGGGGG GCTGGGGTCC GGGAAGGAGC ACCGGCAGGT GCAGGAGGAC ATCGAGCGCG CCGATGCGAA GGCGCTCTCG GGCATCCTGA ACCGCGACCT GATCCGGCCC TGGGTGGATC TGGAATACGG GCCCCAGGCG CGCTATCCTC GGCTCAAGAT CGCGCGGCCG GAGCCCGAGG ATCTGAAGGC GATGGCCGAG GCGCTCGCAG CCCTCGTGCC GATCGGCCTC AGGGTCAGCC AGAAGAAGAC CCGTGACCGT TTCGGCTTCG ACGAACCCGA AAACGACGCC GATGTGATGG GAGGAACGCC CGCCGCCGCA GCCGTCGCGG CACCCCCGGG CGCGGATCGG CCGATTAAAC GGTTTTCCGG CGTTTTTAAA GGGGGCGAGC CCCCGGCGCG ACCCGAGACA GCCCTGCAGG CGGAAGCGGC TCCAGCGGCC CTCCCAGCGA GTGACGATCC GGCGGCGCTG CTGGCGGATC GGCTGGCGGC CGACGCGGCG CCGGCCATGG GCGCGATGAT CGAGCGGGTC GAGACGATGC TGGCGGCCGC GGGTTCGCTG GCCGAGTTCC GCGAGATGCT GCTCGCGGGC TTTCCCGGGA TCGACGCGGG CGACCTGGCC ACCCTGATGG CGCAGGCGAT GATGGCCGCT CATGCCGGGG GTCGTGCGGC GGCGGAGGAT GCCGGTGCCT GA
|
Protein sequence | MAKTPVLLDR WGKPVKRAVL TEEISAATLG SVRSPITGYP ADGLNPVRLA SILREADAGD PVRYLELAET IEERDLHYLG VLGTRRRSVS QLDITVEAAS DDPRDVEIAD MIRDWLTRDE LSDELFHMLD CIGKGYSFTE IIWDTSEGQW RPARLEWRDP RWFRFDRAAL TTPLMLGPHG EELELTPFKF IFAEVKAKSG IALRSGLARA AAWAWMFKAF TQRDWAIFTQ TYGQPLRLGR YGPGASEDDK ATLFRAVANI AGDCAAIIPE SMAIDFVETK SVGATADLYK QRADWLDQQI SKAVLGQTAT TDAVTGGLGS GKEHRQVQED IERADAKALS GILNRDLIRP WVDLEYGPQA RYPRLKIARP EPEDLKAMAE ALAALVPIGL RVSQKKTRDR FGFDEPENDA DVMGGTPAAA AVAAPPGADR PIKRFSGVFK GGEPPARPET ALQAEAAPAA LPASDDPAAL LADRLAADAA PAMGAMIERV ETMLAAAGSL AEFREMLLAG FPGIDAGDLA TLMAQAMMAA HAGGRAAAED AGA
|
| |