Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_2753 |
Symbol | |
ID | 4023251 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | + |
Start bp | 3073343 |
End bp | 3074638 |
Gene Length | 1296 bp |
Protein Length | 431 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637962951 |
Product | hypothetical protein |
Protein accession | YP_569882 |
Protein GI | 91977223 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.429535 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTATCGG TGGAATTGGC AATCGTTGTC GGGTTGATCG TCGCCAACGG ATTGCTGTCG ATGTCTGAGC TGGCGATCGT GTCGTCACGC CCGGCCCGGC TTGCGCTTCT GGCTCAGAAG AACGTGCGCG GTGCCCGGCA GGCGATGAAG CTGGCCGAAG ACCCCGGCAA ATTCCTGTCC ACCGTCCAGA TCGGCATCAC TTTGGTCGGC GTGCTGTCCG GCGCGTTCTC CGGCGCCACG CTCGGCCAGC GCCTGACCGA ACAACTGAGC GAGCACAATG TGCCGTTCCC GGACGTCATC GGCTTCGGCC TGGTGGTCAC GCTGATCACC TACGCGACCT TGATCGTCGG CGAATTGGTG CCGAAGCAGG TCGCGCTGCG CGATCCCGAA GCTGTGGCGG TCAAGGTCGC ACCGGCGATG ACGTTGATCG CCAAGGTCTC GCGGCCGGTG GTGTTCGTCC TCGATCTCTC CGGCAAGGCG ATCCTGAAGA TTCTCGGTCA GGGCGGCGCA GCCGAGGAGA AGGTTTCCGA GGAAGAGATC CACAATCTGG TGATGGAAGC GGAGACCGCC GGTGTGCTGG AGCCCGGCGA GCGCGAGATG ATCGCCGGCG TCATGCGGCT CGGCGACCGG CCGGTCGGCG CGGTGATGAC GCCGCGACCG GATGTCGACC TGATCGATCT CGGCGATCCG CCCGATGTGA TCCGCGACGC CTTCCTCAAC AGCCCGCATT CCCGGCTGCC GGTCACCGAC GGCGATCGCG ACAATCCGAT CGGGATCATC CAGGCCAAGG ACATGCTGGG AATCTATCTG CGCGGCGACA AGCCGGATCT GCGCGCGGCG GTGCGCGACG CGCCGGTGAT CCCGTCCTCC GCCGACGCTC GCGACGTGCT GGCGACGCTG CGGCAATCGT CGGTGCATAT GGCGTTGGTC TATGACGAAT TCGGAGCGTT CGAGGGCGTG GTCACGACCG CCGATATTCT CGAATCGATC GTCGGCGCAT TCGGATCGGA AGACGGTCCG CCCGAGCCCG CCGCGGTGCT CCGCCAAGAC GGTTCCTATC TGATCGCGGG CTGGATGCCG GTGGATGAAT TCGGCGATCT GCTGCCAGTC TCGATCCCGC CGAATCGCGA CTACCACACC GTCGCGGGCC TCATTCTGCA GCATTTCGGC GCGCTGCCGG TGGTCGGCGA CAGGTTTGAC TATGAGGGCT GGGAGATTGA AATTCTCGAT CTCGACGGCC GGCGGATCGA CAAGATCATG GCGACCCGGA GCGCAAACGC GGAGGCTGTG GCGTGA
|
Protein sequence | MLSVELAIVV GLIVANGLLS MSELAIVSSR PARLALLAQK NVRGARQAMK LAEDPGKFLS TVQIGITLVG VLSGAFSGAT LGQRLTEQLS EHNVPFPDVI GFGLVVTLIT YATLIVGELV PKQVALRDPE AVAVKVAPAM TLIAKVSRPV VFVLDLSGKA ILKILGQGGA AEEKVSEEEI HNLVMEAETA GVLEPGEREM IAGVMRLGDR PVGAVMTPRP DVDLIDLGDP PDVIRDAFLN SPHSRLPVTD GDRDNPIGII QAKDMLGIYL RGDKPDLRAA VRDAPVIPSS ADARDVLATL RQSSVHMALV YDEFGAFEGV VTTADILESI VGAFGSEDGP PEPAAVLRQD GSYLIAGWMP VDEFGDLLPV SIPPNRDYHT VAGLILQHFG ALPVVGDRFD YEGWEIEILD LDGRRIDKIM ATRSANAEAV A
|
| |