Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_2000 |
Symbol | |
ID | 4022482 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | + |
Start bp | 2236920 |
End bp | 2238320 |
Gene Length | 1401 bp |
Protein Length | 466 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637962193 |
Product | hypothetical protein |
Protein accession | YP_569136 |
Protein GI | 91976477 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.203804 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCAGA CACCGACCGA ACGGCTGAGG GAGTACCTCG CCCAGCTCCC GCCTCAGTCG CAGGCGCTGC TGATGCGGGA GTTCGAGCGT GCGCTGGAGC GTGGCGACGA GGTCGCCGTG GCCAGCTTCG TGCTCGAAGA GCTGCGCAAG ATCGTCCGCG GTTCCGATGA AGAATCCGCG CCGCGGACCG ACGATCCGGC GCGGTTGATG TTCCGCTCGC TCGAGCCGTT TCTGATCGAC AACAGCCAGC AGCCTCGGCC GGGCCAGATC CGCCGCGCGT CGCTGAGCTC GATCTGGCAA TGGCTGGTGA GCGAGGGAAT TCCCACGCCA GTCCGGGAAT TCGAAGCCGA CCTGATCAGG TTGCGCAAGG GCTCCGCCGT CGAGATCGAC GCGCTGGTCC GCAAGCTGCA GGCCGTGGCG GCCGAGGCGA TCGACAAGGT GATCAACCCG GAGCCGGGGA TCGACCGGCA GCGCGCGATG GCGCGGGTAG GGCCGCCATC GGCGGTCGAG GATCTGCCGG CTATCGGGGC GGTGCTCAAG AACCGCGAGG CGCTCGAAAC CTTCGACGCC AAGCTGTCGT CGAATCTCAG GGCGTTCGGC GACTCGCAGG TCACGTCGAT GATTGCGTCG CTCAACGTTC CGGCGCTGCA AACCCCGACC ATGCTGCCGT TCGCGCTGAC GATGATCCTC GCCCATTTGA CCCAGCCGTG GCAGATCGTC CGGCTGGCGC TCAAGGTCGC CGGCTCCGAC GACGAGATCA GGGTCGCCGC CACGCCCTAT GGCGTCGCCG TCACCATGGC GATCCACGAC GTCGCCCAGC TCACCGCCGA CCTGCGCGAC GAGATCAAGC GCGGCCATTA CAGCAATGTC GCCGAGAAAC TGAAACTGGT CCATGACGGC GTGCGCGGGC TGCGGACCGA ACTCGATATC CGCAGCGACT CGACCTGGGG CAAGCGGCTC GCGGCGATCC GCGTCGACAT TTCCAACGCG CTGAAATCCG AGATCGAAAG CGTTCCGGGC CGTGTGCGCC GGTTGCTGCG GCAACGGCCC GACAAGGAGA TCTCGGCCAA CAGCCGGATC GACCAGATCG AGGTCGATGA AGCCGCGGCG CTGATCGACT TCGTTGCGAT CTGTCGCAAC TACGCGAGCG AACTGGCGAT CAACGAGATG ACGTTGCGGA CCTATTCCGA GCTGCAGCAA TATGTCGAGA AGTCCACCGA GGCGCTGGTG CAGTCGCTGC GCGGCTGCGA TCCGCGGGTG AAGCCGTTCC GGCACATGCA GGCGCTTGCC GCGATCCGGT TCTGCGAAGT GCTGTTCGGC CACGACTACG GCCAGCTGAT GCGCCGGGCG GTGGAAAGCG CGATGGTCGT GGTCGACCGC AAGCCGGCCC GGGCGGGGTA A
|
Protein sequence | MSQTPTERLR EYLAQLPPQS QALLMREFER ALERGDEVAV ASFVLEELRK IVRGSDEESA PRTDDPARLM FRSLEPFLID NSQQPRPGQI RRASLSSIWQ WLVSEGIPTP VREFEADLIR LRKGSAVEID ALVRKLQAVA AEAIDKVINP EPGIDRQRAM ARVGPPSAVE DLPAIGAVLK NREALETFDA KLSSNLRAFG DSQVTSMIAS LNVPALQTPT MLPFALTMIL AHLTQPWQIV RLALKVAGSD DEIRVAATPY GVAVTMAIHD VAQLTADLRD EIKRGHYSNV AEKLKLVHDG VRGLRTELDI RSDSTWGKRL AAIRVDISNA LKSEIESVPG RVRRLLRQRP DKEISANSRI DQIEVDEAAA LIDFVAICRN YASELAINEM TLRTYSELQQ YVEKSTEALV QSLRGCDPRV KPFRHMQALA AIRFCEVLFG HDYGQLMRRA VESAMVVVDR KPARAG
|
| |