Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_1144 |
Symbol | |
ID | 4895429 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | + |
Start bp | 1183350 |
End bp | 1187240 |
Gene Length | 3891 bp |
Protein Length | 1296 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640111730 |
Product | putative phage host specificity protein |
Protein accession | YP_001043026 |
Protein GI | 126461912 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.383644 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGACGC TTCTGCTGGC CGCGGCCGGA TCGGCGATCG GAGCAGGCTT CAGCGGCACG GTGCTCGGGC TGACAGGGGC GGTCATCGGG CGCGCGATCG GGGCGACGGT CGGCCAGCTG ATCGACCAGC GGCTGATGGG CGCGGGATCG CAGACGGTCC GCACCGGGCG GATCGAACGT TTCCGTCTGA TGGGCGCCGG CGAGGGCGCC CCCGTGGCCC AGCTGTTCGG TCGCAATCGG GTGGCAGGAC AGGTGATCTG GGCCTCGCGC TTCCTCGAGA GCCAGTCCGA GAGCACGGGC GGCGGAAAGG GCGGCGGCCC GAGATCGACA GTCGTGAGCT ATTCCTATTC CGTCAGCCTC GCCGTCGCGC TGTGCGAGGG TGAGATCCTG CGCGTGGGCC GCATCTGGGC CGATGGTGCC GAGATCGCGA CGGGGTCGCT CAACCTTCGG CTCTATCGCG GGACGGAAGC TCAGCTGCCG GATCCCAAGA TCGAGGCAGT GGAGGGGGCG GGCATGGCGC CGGCCTACCG CGGGATCGCT TATGTGGTGA TCGAGGATCT CGACCTCGCC CGGTTCGGCA ACCGGGTGCC GCAATTCTCG TTCGAGGTGA TGCGGGCAGC GCAGGGGGCT CTGGCGGATC GGGTGATGAC CCTTCAGCGC GCGGTGAAGG GCGTGGCGCT GATCCCGGGC ACGGGAGAAT ACGCTCTGGC CACGAGCCGT CTCGCCTACA GCCCCGATCC GGGAGAGACC GAGGCCGCCA ACGTGCACAC GGCGTCTGGG GAAAGCGACA TGCGCAGCTC GCTTGCGCAG TTGAAGGGCG AGCTCCCGGC CTGCCGGTCG GTGTCGCTGG TCGTGTCCTG GTTCGGGAGC GACCTGCGCT GCGGCGCCTG CGAGATCCGC CCCAAGGTCG AGGGGCATCT CGACGCGGCG ACCATGGCCT GGCGGTCGGG CGGGATCGAC CGCACGCAGG CCGAACGGGT GGCGCGGATC GACGACCGTC CGATCTATGG GGGAACGCCT GCGGATGCGG CGGTGGTCGA GGCGATCCAC GCCCTGAAGG CGCAGAACTG CTCGGTGCTC TACTATCCGT TCATCCTCAT GGAACAGCTT CCCGGTAATG ATCTGCCGGA CCCCTGGTCA GGTGCTGCGG GCCAGCCGGT GCTGCCCTGG CGCGGGAGGA TCACACTCTC GACAGCGCCG GGTCAGCCGG AGACGCCTGA CAGGACGGGA GCCGCCGAGG ATGAGGTGCG CGCCTTCTTC GGCGACGCGG ACCCTCGCCA GTTCCGCATC GAGGACGGGC GCGTGATCCA TGACGGCGCC GACGACTGGC GCTACCGGCG CTTCATCCTC CACGCGGCCT GGCTCTGCCG GCTGGCCGGC GGGGTGGAGG CCTTCTGCGT GGGCTCCGAG TTGCGCGGAC TGACGGCGAT CCGGGGGGCG GACGACAGCT TCCCGGCGGT GGCCGAACTG CAGAGGCTTG CGGCCGACGT GCGCGGCATC CTCGGGCCCG ACGTGAAGAT CGGTTACGCC GCCGACTGGA GCGAATGGTC CAGCCTCCAT GCCGATGGCA ATCTCTACTT CCATCTCGAT CCGCTCTGGT CGGATCCGAA CATCGATTTC ATCGGCATCG ACAATTACAT GCCGCTCTCG GACTGGCGCG AGGCCGAGAC TCATGCGGAC GCGCGGTGGC CCTCGGTGCA GGATCTGGGC TACCTCAAGT CGAACATCGC CGGAGGCGAG GGTTTCGCGT GGTATTATCC CGACGAGGCC GCGGCGGCGG CTCAGGACCG GCGGCCGATC ACGGATGCGG ATTTCGGAGA GCCCTGGGTG CATCGCCCCA AGGACCTCCG AACCTGGTGG TCGCTGCCCC ATCACGAGCG CATCGGCGGG GTGCGGCTGG CGCAGCCGAC CGGCTGGGTC CCGCGCTCGA AGCCGATCTG GTTCACCGAA TATGGGTGTC CGGCGCTCGA CCTCGGGAGC AATCAGCCGA ACCTGTTCAT CGATCCCAAA AGCTCGGAAT CCGCGCTGCC GCGGGGCTCG TCCGGACGAC GCGACGACGG GATCGCGATG AACTACCTGC GCGCGATGGC CGAATACTGG CAGGACGAGG CCAACAATCC GGTGTCGGAC CTCTATGGTG GCTCCATGGT CGACATGGAC CGCGCCCATG CCTGGGCCTG GGACGCCCGT CCATTTCCGG CCTTTCCCGC GCTCGGCGAT GTCTGGTCCG ACGGGTCGAA CTATGCGCGC GGTCATTGGC TGAACGGTCG CGCAGCCTCG CAACCGCTGT CCGCCGTCGT GGCCGAGTTC TGCGAACGGT CCGGGGTGAC CGCCATCGAC GTCACGCAGT TGCAGGGCGT GGTGCGCGGC TATGGGATTT CCGAGGTGGA CAGTGCGCGG GCGGCCTTGC AGCCGCTGAT GCTGGCCTAT GGCTTCGAGG CGCTGGAGCG CGAGGGCAAG CTGATCTTCC GCATGCGCGA CGGACGCGCG AAGGCCACGG TCTCGCGCGA GGACCTGGTG GCCTCGGACG ATCTCGAGGG CGCGCTCGAG CGCGTGCGCG CGGCGGAGGT CGAGACCGCA GGCCGGGTTC GGCTGCACTT TCTCGAGGCG GAAGGCGATT ATGAGGCACG TCAGGTCGAG GCGGCCTTCC CGGACGAAGC GACCTTTGCC GTCTCGCAAT CCGAAGTGCC GCTCGTGCTC ACCTCGGGCG AGGCGCGGGG CGCCGTCGAG CGGTGGCTGG CCGAGGCCCG CGTGGCGCGC GACAGCCTGC GCCTGGCCCT GCCGCGGTCG GCTCTGGCCT TCGGTGCCGG CGACGTTCTG GAGATTGAAG GCCGGCGCTA TCGCATCGAC CGCCTCGATC AGGCCGAATG CCAGCTGGCG GAGGCGGTTC GTGTGGAACC CGGCATCTAC CGCCCCGCCG ATATGGCCGA AGAGCAGGTG CGCGCGCGTC GCATTGCGGC TTCCGGTCCG GTCCGGCCGA TCTTTCTCGA TCTGCCCCTG CTCACGGGAA CGGAGCTGCC CCATTCCCCC CACTTCGGTG TGGCAGCAAC GCCTTGGCCG GGGCAGGTTG CGGTCTGGAG CGCGACGCAG GATGCGGGCT ATCGCCTGAA CCGGCTTCTG GGCGTGGCCG CAACGGCGGG CATCACCGAG ACGATCCTGC GGGCTGCACC GATGGGGCGA TGGGACATGG GCGCGCCGCT CCGGGTGCGG CTGCAGGGGC CGCCGCTCGC CGCGGCTTCG GATCTCGAGG TGCTGAACGG AGCCAACCTC ATGGCCATCG GCGACGGCCG ACCCGACGGG TGGGAGCTCT TCCAGTTCGC GGACGCGCGG CTGGTTTCGC CGCGGGTCTG GGAGCTTTCG CGGAGGCTCC GGGGGCAGGC GGGAACGGAC GGGCAGATGC CCGAGATCTG GCCCGAGGGC AGCACGGTCG TTCTCGTCAA CGGCGCGCTG ACGCAGCTTC AGATGCCGCT GGCCGACCGC GGGCTCGCCC GGCATTACAG GATCGGTGCC GCTTCTCGCG GTTACGACGA TCCGGACACC GTCCACCGGA CCGAGGCTTT CGCGGGGGTC GGGCTCCGGC CCTATGCGCC GGTCCATCTG GCGGTGGAGC GGGACGGAGA GCACATCCTG CTGCGCTGGA TCCGTCGCAC CCGGATCGAC GGCGACAGCT GGGAGGGGCG CGAGGTACCG CTGGGAGAGG AGAGCGAGCT TTATCTGGTC CGCGTGGTGG CGGGGGGGAA GGTGCGCCGG CAGGAGGAGG TGCGGGAGCC CCGCTGGACA TACACCGGGG CCGCGCAGGC GGCGGATGGC GTGGGAAGCG CCTTCTTCCT CGAAGTCGCC CAGGTGTCGG AGAGCTTCGG GCCCGGGCCG TTCCGGCGCA TCGCGCTCTG A
|
Protein sequence | MATLLLAAAG SAIGAGFSGT VLGLTGAVIG RAIGATVGQL IDQRLMGAGS QTVRTGRIER FRLMGAGEGA PVAQLFGRNR VAGQVIWASR FLESQSESTG GGKGGGPRST VVSYSYSVSL AVALCEGEIL RVGRIWADGA EIATGSLNLR LYRGTEAQLP DPKIEAVEGA GMAPAYRGIA YVVIEDLDLA RFGNRVPQFS FEVMRAAQGA LADRVMTLQR AVKGVALIPG TGEYALATSR LAYSPDPGET EAANVHTASG ESDMRSSLAQ LKGELPACRS VSLVVSWFGS DLRCGACEIR PKVEGHLDAA TMAWRSGGID RTQAERVARI DDRPIYGGTP ADAAVVEAIH ALKAQNCSVL YYPFILMEQL PGNDLPDPWS GAAGQPVLPW RGRITLSTAP GQPETPDRTG AAEDEVRAFF GDADPRQFRI EDGRVIHDGA DDWRYRRFIL HAAWLCRLAG GVEAFCVGSE LRGLTAIRGA DDSFPAVAEL QRLAADVRGI LGPDVKIGYA ADWSEWSSLH ADGNLYFHLD PLWSDPNIDF IGIDNYMPLS DWREAETHAD ARWPSVQDLG YLKSNIAGGE GFAWYYPDEA AAAAQDRRPI TDADFGEPWV HRPKDLRTWW SLPHHERIGG VRLAQPTGWV PRSKPIWFTE YGCPALDLGS NQPNLFIDPK SSESALPRGS SGRRDDGIAM NYLRAMAEYW QDEANNPVSD LYGGSMVDMD RAHAWAWDAR PFPAFPALGD VWSDGSNYAR GHWLNGRAAS QPLSAVVAEF CERSGVTAID VTQLQGVVRG YGISEVDSAR AALQPLMLAY GFEALEREGK LIFRMRDGRA KATVSREDLV ASDDLEGALE RVRAAEVETA GRVRLHFLEA EGDYEARQVE AAFPDEATFA VSQSEVPLVL TSGEARGAVE RWLAEARVAR DSLRLALPRS ALAFGAGDVL EIEGRRYRID RLDQAECQLA EAVRVEPGIY RPADMAEEQV RARRIAASGP VRPIFLDLPL LTGTELPHSP HFGVAATPWP GQVAVWSATQ DAGYRLNRLL GVAATAGITE TILRAAPMGR WDMGAPLRVR LQGPPLAAAS DLEVLNGANL MAIGDGRPDG WELFQFADAR LVSPRVWELS RRLRGQAGTD GQMPEIWPEG STVVLVNGAL TQLQMPLADR GLARHYRIGA ASRGYDDPDT VHRTEAFAGV GLRPYAPVHL AVERDGEHIL LRWIRRTRID GDSWEGREVP LGEESELYLV RVVAGGKVRR QEEVREPRWT YTGAAQAADG VGSAFFLEVA QVSESFGPGP FRRIAL
|
| |