Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17025_4052 |
Symbol | |
ID | 5086225 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17025 |
Kingdom | Bacteria |
Replicon accession | NC_009430 |
Strand | + |
Start bp | 91268 |
End bp | 94330 |
Gene Length | 3063 bp |
Protein Length | 1020 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640485615 |
Product | hypothetical protein |
Protein accession | YP_001170209 |
Protein GI | 146280052 |
COG category | [S] Function unknown |
COG ID | [COG4457] Uncharacterized protein conserved in bacteria, putative virulence factor |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0348037 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.215609 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCGCCG ACAGGAAAGA ACGCCTCCGC CTTCTGGTCA ATTGGCACGA CGAGATCACG CTGGTGCCCT TCTCGGGCAT CCAGATCCTC GATTTCGGCT TCCGCATGGA TGCGCTGACG CTGCGCCCGC TGCGCTTCAT CGAGCGCACG GTCAGCGCCG GGCCCGACCG GAGCGAGCGG ATGCTGATCC CGCTCAGCGG CCGCGAGGAG CATGACGCGC CGATCGAATC CGACGCCCGT CCCGACGACG ACGAATATTC CATCCGCCCC ACCGCCGCGC TCGAGCCCTT CCTGGCCAAG TGGGTGCCGG TGCCGGTGCT GCGCATCAAG AGCGAGCGCG GGCCGGGGGG CGAGGAGCGG TTCGATCCGG GCCCCTCGAG CTGGGCGCGG ATGCGCACGG TGGAGCTGGC CGAGCCCGAT CCCGAGACCG GCTTCACCCA CCGGGTGCAG CTGGCCCTCG ACACCACCCT CGTCGCCCAG GACCAGAGCC GCCACTATGT CGCCCCCGAG CGCGCCGACG CCGAGAAGCC GCGCGACTTC CGCTTCGTCT CGGATCCGGC GGTGATGGAC TGGTTCCTGC GCCGGCTCGA GGAGGGCGAC GACGGCACGA TGATCGACCT GCAGCTCTGG GCCTCGGACT GGCTGAAGGA GCTGTTCCTC GCCTTCAAGC GCGCCGAGCG GCCGGGCCGC ACCGTCACCG AGGACAGCCT GCCGCACCAG TTCGAGCACT GGGCGCGCTA CCTGGCCTAT CTGCAGACCA TCGACCACGC CGTCCGGGTG CCGCGGATGC GCTTCGTCAA CACCGTCTCC GAGCGCGATG CGGTGACGCC GGTCGATGTG GATCTGGTGC TCGACGTGGG CAATTCGCGC ACCTGCGGCA TCCTGATCGA GCGCTTTCCC GGCGAGGGGC GGGTGGATCT GGTGCGCTCC TTCCCGCTCG AGATCCGCGA CCTCTCGCGC CCCGAGCTGC ATTATTCCGG CCTGTTCGAG AGCCGCGTGG AATTCGCCGA GCTGAAGTTC GGCGAGGATC ATTTCGCCAG CCGCTCGGGC CGGCGCAACG CCTTCGTCTG GCCGGGTTTC GTCCGGGTGG GCCCCGAGGC GCTGCGGCTG ATCCAGGGCG AGGAGGGCAC CGAGACCTCC TCGGGCCTCT CCTCGCCCAA GCGCTACATC TGGGACGACG AGGCGCGCCA GCAGGACTGG CGCTTCCACA ACCACCACGA TCCCAACAAC CTGCCCAAGT CGGTGCGCGC GGCCATGCGC CAGCTGAACG AGGCGGGCGA CGTGCTCGAA CAGGTGCGCT ACGAGGAGAG CCGGCGCCTG CGCCCGCGCG GCCGGACCGC GCCGGCGCGC GCCATCCGGC CGCGCTTCTC GCGCTCGGCG CTCTTCGGCT TCATGCTGGC CGAGATCATC AGCCATGCGC TTGTGCAGGT GAACGACCCG GCCTCGCGGT CGCGGCGGGC GCAGTCGGAC CTGCCGCGGC GGCTGAACCG GATCATCCTG ACCCTGCCCA CCGCCACCTC GGTGCAGGAG CAGGCCATCA TCCGCTCGCG CGCCGAGGGG GCGCTCCGTC TGGTCTGGAG CACGCTGGGC GTGGCCGACA CCGAGACGAG CATCTCGCGA AGGCCCGAGC TGATCGTGGA ATGGGACGAG GCGAGCTGCA CGCAGCTCGT CTATCTCTAC AGCGAGCTGA CGCAGAAGTT CGACGGCAAC ATCAACGCCT TCCTCCAGCT CAAGGGCCAT CCGCGGCGGC GGGCGGGCGC GGCCGAACCC GCGCCGAGCC TGCGGCTGGC CTGCATCGAC ATCGGCGGCG GCACCACCGA CCTGATGATC TGCACCTACT GGGGCGAGGC CAACCGGGTG CTGCATCCCG AGCAGACCTT CCGCGAGGGC TTTCGCGTGG CGGGCGACGA TCTGGTGCAG CGGGTGATCT CGGCCATCAT CCTGCCGCGG CTGCAGGCCT CGATCGAGGC GGCGGGCGGG CGCTATGTCG GCGAGAAGGT GCGCGAGCTC TTTGCCGGCG ACATCGGCGG GCAGGACCAG CAGGTGGTGC AGAAGCGCCG CCAGTTCGCG CTGCGGGTGC TGATGCCGCT TTCGGTGGCG ATCCTCGCCC ATTGCGAGAC CGCCGACGAA TTCGACCGCT TCGACCTCGA GGTGGGGTCG GCGCTGCGGC CGGCGGTCTC GCAGGAGATC CTCGGCTATC TCGAGGGGGC GGCGCGCGAT CTGGGCGCCG CGGGCTGGAG CCTGGCCGAC GTGGTGCTGA CGGTCTCGCG CGAGGATGTG GACGCCATCG CCCGCGAGGT GTTCCAGAAG GTGCTGGGCA ACATGGCCGA GGTGATCGAC CATCTGGGCG TGGATGTGGT GCTGCTGACG GGCCGCCCCT CGCGGCTGCC CGCGGTGCGC GCCATCGTCG AGGAGATGCT GGTGGTGCCG CCTCACCGGC TGGTCTCGAT GCACCGCTAC AAGACCGGCC GCTGGTATCC GTTCCGCGAC CCGATCACCC AGAGGATCGG CGACCCCAAG AGCACGGTGG CGGTGGGGGG GATGCTCATC GCCCTGTCGG AAAGCCGGAT CCCGAACTTC AAGGTCTCGA CCGGGGCCTT CCGCATGCGC TCGACCGCGC GCTTCGTCGG CGAGATGGAC AGCAACGGCC AGATCCGGGA CGAGCGGATC ATGTTCTCGG ATCTCGATCT CGACGCGGCC CGGCCCGGCA CGCAGCAGAC CGCGCTGGTG CGGATGTTCG CCCCGATCCA CATCGGCTCG CGCCAGCTTC CGCTCGAACG CTGGACGACG ACGCCGCTGT TCCGGCTCGA CTATGCCAAC GCGGCCGCGC AGCGGCGGCC CTCGCCCATC CTCGTGACCT TCGAGAAGGC CGAGTTCGAC GACGGCGAGG CCGAAACCTC GGAGGACCGG CTGCGGCGCG AGGCGCAGCG CGAGTTCCTG AGGATCACCG AGGTGGAGGA CGGCGCCGGG GACGGCATGA AGACCTCCGA CCTGTGCCTC AAGCTGCACA CGCTGGGGTT GGACGACGAA TACTGGATCG ACACCGGGGT CTTCCAGTAC TGA
|
Protein sequence | MIADRKERLR LLVNWHDEIT LVPFSGIQIL DFGFRMDALT LRPLRFIERT VSAGPDRSER MLIPLSGREE HDAPIESDAR PDDDEYSIRP TAALEPFLAK WVPVPVLRIK SERGPGGEER FDPGPSSWAR MRTVELAEPD PETGFTHRVQ LALDTTLVAQ DQSRHYVAPE RADAEKPRDF RFVSDPAVMD WFLRRLEEGD DGTMIDLQLW ASDWLKELFL AFKRAERPGR TVTEDSLPHQ FEHWARYLAY LQTIDHAVRV PRMRFVNTVS ERDAVTPVDV DLVLDVGNSR TCGILIERFP GEGRVDLVRS FPLEIRDLSR PELHYSGLFE SRVEFAELKF GEDHFASRSG RRNAFVWPGF VRVGPEALRL IQGEEGTETS SGLSSPKRYI WDDEARQQDW RFHNHHDPNN LPKSVRAAMR QLNEAGDVLE QVRYEESRRL RPRGRTAPAR AIRPRFSRSA LFGFMLAEII SHALVQVNDP ASRSRRAQSD LPRRLNRIIL TLPTATSVQE QAIIRSRAEG ALRLVWSTLG VADTETSISR RPELIVEWDE ASCTQLVYLY SELTQKFDGN INAFLQLKGH PRRRAGAAEP APSLRLACID IGGGTTDLMI CTYWGEANRV LHPEQTFREG FRVAGDDLVQ RVISAIILPR LQASIEAAGG RYVGEKVREL FAGDIGGQDQ QVVQKRRQFA LRVLMPLSVA ILAHCETADE FDRFDLEVGS ALRPAVSQEI LGYLEGAARD LGAAGWSLAD VVLTVSREDV DAIAREVFQK VLGNMAEVID HLGVDVVLLT GRPSRLPAVR AIVEEMLVVP PHRLVSMHRY KTGRWYPFRD PITQRIGDPK STVAVGGMLI ALSESRIPNF KVSTGAFRMR STARFVGEMD SNGQIRDERI MFSDLDLDAA RPGTQQTALV RMFAPIHIGS RQLPLERWTT TPLFRLDYAN AAAQRRPSPI LVTFEKAEFD DGEAETSEDR LRREAQREFL RITEVEDGAG DGMKTSDLCL KLHTLGLDDE YWIDTGVFQY
|
| |