Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_4538 |
Symbol | |
ID | 3912355 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 5131243 |
End bp | 5132865 |
Gene Length | 1623 bp |
Protein Length | 540 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637886442 |
Product | hypothetical protein |
Protein accession | YP_488132 |
Protein GI | 86751636 |
COG category | [S] Function unknown |
COG ID | [COG5338] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGGCG CGGCCGGGCA CGCCGAGGCG CAGAGCCTGA CCCGCGACCT GTTCCGGCCG GAGCGCGGCG CCTTCGTCGC GCCGCAGGAC CTGCCGCTGC AACGCACGCC GCGGCTGGCC GCCGCGCCCG ATCCCTATGC GACCGACAAT GATCCGCGCA GAGACAGAAC CACGCCGCCG CGGCTCGGAT CGCCGAATTT CGGGCTGCAG CCCAGCCTGG GCAGCGCCGG CACCGGCTAT GACGCGCTCG GCCGCAAACG CCAGAAGCCG AAGATCTTTC CCGGCGCGCC GCAGCCCAAG GCGGTCGGCC CCGGCTCAAA GCCGGTGATC GCCGCCCCGC CGCCGGCCCG TCCGCTGCCG CCGTCGCAGA ATGCCGCCAA GCCGCCGGTT CCGGCCGCCT TCACCGGAAC GCTGCCCGGC CAGCCGACCC GGCGCCGGCT CAAGCCCGAC CTCGATCCGT TCGGCTCGGT CGGCGATTAC GCCGGCAGCT TTCTGTTCAA GGGCGCGATC GAACTGAACG GCGGCTACGA TACCAATCCG GCCCGCATCA CCACGCCGCG CGCCTCCGGC TTCTACAAGA TCTCGCCCGA ATTGATGGTG ACCTCGGACT GGGAGCGCCA CGCGCTGGTT GCCGATCTGC GCGGCTCCTT CACCGGCTAT GGCCACGACT TCATCAATCC GGCCGGCGCC GTGTCGTCGG CGCCGCGCAA TCTCGACCGC CCCGATTTCA ACGGCCACGT CGACGGCCGG ATCGACGTCA CCCGCGATAC CAGGCTGCTC GGGCAGGGCC GGCTGATCGT CGGCACCGAC AATCCCGGCA GCCCGAACGT GGAGGTCGGG CTGCAGAAAT ATCCGATCTA TACCCAGACC GGCGTGTCCG GCGGCATCGA CCAGAACTTC AACCGGCTTC AGGTCACGGC GATCGGCAGC GCGGATCGTC GCGCCTTCCA GAAGTCGGTG TTCACCGACG GCAGCACCGA CACCAACAAC GACCGCAACT ACAATCAATA TGCCGGCACC GGCCGCGTCA GCTACGAGAT TCTGCCGGGG CTGAAACCGT TCGTCGAAGG CCAGGCCGAT ACGCGCACGC ACGACACCGT CACCGATCGC TACGGCTATC GGCGCGACAG CAATGGCGGC TACGTCAAGG GCGGCACCAG CTTCGAACTG ACGCGGCTAT TGACCGGCGA AGCTTCGATC GGCTGGGCAT CGCGCACCTA CAGCGATCCG CGGCTGCTGA AGCTCGACGG CCTGCTCACC AGCGCCTCGC TGATCTGGAC GATGACGCCC CTGACGACGG TCAAATTCAT CGCCGACACC AGCATCGACG AATCGCCGCT GTCCGGCGTG TCCGGCGTGC TGACGCGGAC CTACACGGCC GAGGTCGACC ACGATTTCCG CCGCTGGCTG ACCGCGATCG GCAAGTTCAC CTACGCGACC TACGATTACC AGGGCTCGGG CCGCAGCGAC CGCTTCACCT CGCTCGAGGG CAATCTGGTC TACAAGCTGA ATCGCTCGCT TTGGGTCAAA GGCACGCTGC GCCGCGACCA GCTCGATTCC AATATCGTCG GCGGCAGCTA CAATGCCACC GTCGTGATGC TCGGCGTGCG GCTGCAGAAC TGA
|
Protein sequence | MAGAAGHAEA QSLTRDLFRP ERGAFVAPQD LPLQRTPRLA AAPDPYATDN DPRRDRTTPP RLGSPNFGLQ PSLGSAGTGY DALGRKRQKP KIFPGAPQPK AVGPGSKPVI AAPPPARPLP PSQNAAKPPV PAAFTGTLPG QPTRRRLKPD LDPFGSVGDY AGSFLFKGAI ELNGGYDTNP ARITTPRASG FYKISPELMV TSDWERHALV ADLRGSFTGY GHDFINPAGA VSSAPRNLDR PDFNGHVDGR IDVTRDTRLL GQGRLIVGTD NPGSPNVEVG LQKYPIYTQT GVSGGIDQNF NRLQVTAIGS ADRRAFQKSV FTDGSTDTNN DRNYNQYAGT GRVSYEILPG LKPFVEGQAD TRTHDTVTDR YGYRRDSNGG YVKGGTSFEL TRLLTGEASI GWASRTYSDP RLLKLDGLLT SASLIWTMTP LTTVKFIADT SIDESPLSGV SGVLTRTYTA EVDHDFRRWL TAIGKFTYAT YDYQGSGRSD RFTSLEGNLV YKLNRSLWVK GTLRRDQLDS NIVGGSYNAT VVMLGVRLQN
|
| |