Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17025_3239 |
Symbol | |
ID | 5085988 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17025 |
Kingdom | Bacteria |
Replicon accession | NC_009429 |
Strand | + |
Start bp | 102538 |
End bp | 104547 |
Gene Length | 2010 bp |
Protein Length | 669 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640484811 |
Product | hypothetical protein |
Protein accession | YP_001169428 |
Protein GI | 146279270 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.983308 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.678486 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGAGAT TGGATGTGCG CGATTTTGCA GACAGGGATG ATGTTGTTCC GCCGGGGGGG CGGCCGGCGT TTGATGAACT CGAGAGCTAC TGGGCCGGCC TGAACCGCAT CCGCGACGCC CTGCTGGAAC TTACGGGAGA CGCGGCGCCG GCGCTTCGGG CGCAGCTTGA CCGCATCATC GAACGGATCG ACGCCTTCGA GCCCTCGGTC AGCGTGCTGG GTCAGGTCAA GGCCGGCAAG AGCACGCTTC TGAACGCGCT GATCGGCCGG CCGGGCCTGC TGCCTTCGGA CGTCAATCCC TGGACCTCGG TCATCACCAA CGTTCACCTG AACTCGCCCC GCCGCCCGCT CGAGACCCGG GCGCTCTTCC GCTTCTTCGA CGCGCTGGAA TGGGACCGGC TGGTCACCAC GGGCGGGCGC CTTGGCGAGA TGGCGCGGCG GGCCGGCTTT GCGTCCGAGG CCGAGCAGAT CCGGGCGCAG GTGATGGCGA TGCGCGGCAC GACCGAGGCT CGCCTCGGCC AGGACTTCGA GAGGTTGCTT GGCAGCAGCC ACGCCTTCCC GGACCTCTCG AGCGACGTGA TTGACCGCTA CATCTGCTAC GGCGACGCGG GCGACTCGAA GGGCCGCGCG CAGGGCTTCT ACGCCGACAT CACCAGGACG GCGGATCTTT TCGTCGATCT GCCGGGCTAT CCGCAGAACC TCTGCATCCG CGATACGCCC GGCGTGAACG ACACCTTCAT GATGCGCGAG CAGATCACGC TGAACGCGAT CAGCGAAAGC CGCGTCTGCG TGATCGTCCT GTCGGCGCAC CAGGCGCTCT CGACGATGGA CATGGCCTTG CTGCGCATCA TCGGCAATGT CGAGTCGCGC GAGGTGCTGA TCTTCGTCAA CCGCATCGAC GAACTGGCCG ATCCGGTGGC GCAGTGCCGC GAGATCGAGG ACGCGATCCG CCACACGCTC GAGCGGGCCC GGATCGGTGC CGGAATGACC ATCCTCTTCG GCAGCGCTCT CTGGGCGGTT CGCGCGCTCG AGGATCGGTG CGACGCGCTC TCCGAGCAGA GCCGGCGTGC CCTGCTCGCC TGGACGGCGG GGCGGTCGGA GGCGCGCGAT CTCCGCTCTC TGGCCTTCGA GGCGTCGGGC GTGCCTGCGC TTCATGCCGA GATTTCCCGC CGGATCGTCG AGGGGCCGGG GCAGGCCATG CTGGCCGACA TCCAGGCCGA GACCGGGAAC GTGATCTCGG AAGTCGAGAC GGTGGACATC GTTGCGCGCA GCGGCGCGAT GCCCCCGGCG GCGGTCGATC GCCCAGCGAT CGAGTCGCGG GCGGCTGAGA TCGGCCGCCT GTGCGACGAG CGGCTCAAGG CGCGGATCGT CGAGGCGCGG GCGGCGCTCT CCGAGCGGCT TGGCCGCGCG CACGAGCAGT TTGTTGAAGG GGCCGTCGGC GCACTGGCCT CGCACATCTC CGCCTACGGC GAGACCGAGG GCTGGCGCTG CGATCCCACA ACGCTGCGCA TGATGATCCG CTCTTCCTAC CTGGCTGCCG CCAAGACGGT CCGGGATGCG CTTGGCGACT GCACGGGCGA GGCGGCCGAA GGCTTCGCGC GGTTGCTCTG CGGCGATCTC GGGGTGCCGG AGGGAAGCGT GGCCTTCCAC CTGCCGCAGG GGCCCGTGCC GCAGGCGCCG GGGATCATCG CCCAGACCAT CTCGCTCGAC ATGCAGGAGG CCTGGTGGCG GCGCTTCTTC CAGCGGCGGG GATCGCCGGA GGATCGGGCC AATCGCTACC GCGCCCTGAT CATGGCCGAG ACGCTGCCCC TGACCGAGGC GGTGGGCCAG GAGGATTTTG CCGGTTTCTG CCAGGAGGTC GCCGAGGCCG CATGCGTCTT CTGCCGCAAG CAGACCGAGT TCGTGGCGGC CGTTCTCGAT GCGATGGAGC GGCCGGCCGC CGGGCAGGAG GGGGCGCTGG CCGACGTGGC AGGCGCGCTT GAGGATGCGA CAAGAAGGGG AGCGGCATGA
|
Protein sequence | MRRLDVRDFA DRDDVVPPGG RPAFDELESY WAGLNRIRDA LLELTGDAAP ALRAQLDRII ERIDAFEPSV SVLGQVKAGK STLLNALIGR PGLLPSDVNP WTSVITNVHL NSPRRPLETR ALFRFFDALE WDRLVTTGGR LGEMARRAGF ASEAEQIRAQ VMAMRGTTEA RLGQDFERLL GSSHAFPDLS SDVIDRYICY GDAGDSKGRA QGFYADITRT ADLFVDLPGY PQNLCIRDTP GVNDTFMMRE QITLNAISES RVCVIVLSAH QALSTMDMAL LRIIGNVESR EVLIFVNRID ELADPVAQCR EIEDAIRHTL ERARIGAGMT ILFGSALWAV RALEDRCDAL SEQSRRALLA WTAGRSEARD LRSLAFEASG VPALHAEISR RIVEGPGQAM LADIQAETGN VISEVETVDI VARSGAMPPA AVDRPAIESR AAEIGRLCDE RLKARIVEAR AALSERLGRA HEQFVEGAVG ALASHISAYG ETEGWRCDPT TLRMMIRSSY LAAAKTVRDA LGDCTGEAAE GFARLLCGDL GVPEGSVAFH LPQGPVPQAP GIIAQTISLD MQEAWWRRFF QRRGSPEDRA NRYRALIMAE TLPLTEAVGQ EDFAGFCQEV AEAACVFCRK QTEFVAAVLD AMERPAAGQE GALADVAGAL EDATRRGAA
|
| |