Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_4031 |
Symbol | |
ID | 3911838 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 4600978 |
End bp | 4602399 |
Gene Length | 1422 bp |
Protein Length | 473 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637885935 |
Product | PucC protein, chlorophyll MFS exporter |
Protein accession | YP_487635 |
Protein GI | 86751139 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.344544 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGCAG TCAGCCAAAA AATGATGAGA GTCTGGGCCT CTATGGGGTC TCGCTTTCTG CCGTTCGCGG ATGCGGCGAC GCCGGATCTG CCGCTGTCGC GGTTGCTGCG CCTGTCGCTG TTTCAGGTGG CGGTCGGGAT GTCGCTGGTG CTGCTGGTCG GCACTTTGAA CCGCGTGATG ATCGTCGAAC TCAACGTGCC GGCCTCGATC GTCGGCGTGA TGGTCTCGCT GCCTTTGCTG TTCGCGCCGT TCCGCGCGCT GATCGGTTTC AAATCCGACG TCCACAAATC CGTGCTCGGC TGGCGGCGCG TTCCCTTCCT CTATAAGGGC ACGCTGGTGC AATTTGGCGG CCTGGCGATC CTGCCGTTCG CGCTCCTGGT GCTGTCCGGC AGCGGCGATG CGGGCAACGC CCCGGTGTGG ATCGGACAAT TCGGCGCGGC GCTGGCGTTC CTGCTGATCG GCGCCGGCGT TCACACCACC CAGACCGTGG GCCTTGCGCT CGCCACCGAC CTCGCCTCGC CGGAATCGCG GCCGAAGGTC GTCGGCCTGA TGTACACCAT GCTGATGTTC GGCATGATCG CCGCGGCGAT CGTGTTCGGC ATGCTGCTCG CTGATTTTTC GCCCGGCCGG CTGATCCAGG TGATCCAGGG CTCGGCCGTC GTCACCATCG TTCTCAACGG CATCGCCGTC TGGAAGCAGG AAGCGCGGCG CACCTCCGGC GCGACGCAGG CGACCGCGCA TCCCGGCGCG CCCTCCGCGA GCTTCCGCGA ATCCTGGGAC GTCTTCATCC AGGGCAAGGA CGCGATGCGC CGGCTGATCG CGGTCGGCTT CGGCACCATG GCGTTCAGCA TGGCGGACGT GCTGCTCGAA CCCTATGGCG GCCAGATCCT GTCGATGTCG GTCGGCGACA CCACCAAGCT CACCGCGGCG CTCGCGGTCG GCGGTCTGCT CGGCTTCGGC CTCGCCTCGC GCGTGCTGAG CCGCGGCGCA GATCCGTTCC GGATGGCGAG CTTCGGCTCG CTGGTCGGCA TTCCGGCCTT TCTCGCGGTG ATCTTCGCCG CCGAACTGCA GGGCGCCGCG TCGGTGCTGA CATTCGGCTG CGGTACCGCG CTGATCGGCT TCGGCGCCGG CCTGTTCGGC CACGGCACGC TGACCGCGAC GATGAACGCC GCGCCGAAGG ACCAGGCCGG CCTCGCGCTC GGCGCCTGGG GCGCGGTGCA GGCCTCCGCG GCGGGCGTGG CGATTGCGCT CGGCGGCATC ATCCGGGATC TCGTGACGGC GTTCGCTCCG CAGTTCGGCC CGGCCGCGGG TTACAACGCC GTCTACGGCC TCGAACTGCT GCTGTTGCTG GCGACGCTGG CGACGATGGT CCCGCTGATC AAGCGACGGG ACACATTGTT GATGCAGGGA CAACTGAACT GA
|
Protein sequence | MNAVSQKMMR VWASMGSRFL PFADAATPDL PLSRLLRLSL FQVAVGMSLV LLVGTLNRVM IVELNVPASI VGVMVSLPLL FAPFRALIGF KSDVHKSVLG WRRVPFLYKG TLVQFGGLAI LPFALLVLSG SGDAGNAPVW IGQFGAALAF LLIGAGVHTT QTVGLALATD LASPESRPKV VGLMYTMLMF GMIAAAIVFG MLLADFSPGR LIQVIQGSAV VTIVLNGIAV WKQEARRTSG ATQATAHPGA PSASFRESWD VFIQGKDAMR RLIAVGFGTM AFSMADVLLE PYGGQILSMS VGDTTKLTAA LAVGGLLGFG LASRVLSRGA DPFRMASFGS LVGIPAFLAV IFAAELQGAA SVLTFGCGTA LIGFGAGLFG HGTLTATMNA APKDQAGLAL GAWGAVQASA AGVAIALGGI IRDLVTAFAP QFGPAAGYNA VYGLELLLLL ATLATMVPLI KRRDTLLMQG QLN
|
| |