Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_0118 |
Symbol | |
ID | 3908089 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 129405 |
End bp | 131093 |
Gene Length | 1689 bp |
Protein Length | 562 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 637882000 |
Product | hypothetical protein |
Protein accession | YP_483741 |
Protein GI | 86747245 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGGCA GGCCGAAGCG TGGCGCATTT TCCGCCGGCG GCGTCGCGGC GCCGCTGCGC TATGCGCTGT TCCGGCGGAT CTGGCTGGCC AGCCTGCTGT CCAATCTCGG CCTGATGATC AACGGCGTCG GCGCCGCCTG GGCGATGACC CAGATGACGT CCTCGGCCGA CAAGGTGGCC TTGGTGCAGA CCGCCCTGAT GCTGCCGATC ATGCTGGTGG CGATGCCGGC CGGCGCGATC GCCGACATGT ATGACCGCCG GCTGGTCGGG CTGGCGTCGC TGACGCTCGG CCTCGGCGGC GCCACGGCGC TCGCGGTGCT GGCGCATCTC GGCCTGGTGA CGCCGGAGAT CCTGCTCGCC TTCTGCTTCG TGATCGGCAC CGGCATGGCG CTGTTCGGCC CCGCCTGGCA GGCCTCGGTC AGCGAGCAGG TGCCGGGCGA GGCGTTGCCG GCCGCGGTGG CGCTGAACGG TATCAGCTAC AACATCGCCC GCAGCTTCGG CCCCGCGGTC GGCGGCATCG TGGTGGCAAG CGCCGGCGCG GTCGCGGCAT TCGCGGCGAA TGCGGTGCTG TATCTGCCGC TGCTGGCGGT GCTGTTGCTG TGGCGGCGGG TCAGCGAGCC GCCGCGGTTG CCGCCGGAGC GGCTCAACCG CGCGATCGTG ACCGGCGTGC GCTACATCGC CAACTCGCCG TCGATCCGGA TCGTGCTGGC GCGAACGCTG ATCACCGGCA TCGCCGGCTC GTCGGTGCTG GCGCTGATGC CGCTGGTGGC GCGCGACCTG CTGAAGAGCG GCGCCGAGAC CTACGGCATC CTGCTCGGCG CGTTCGGGGT CGGCGCGGTG ATCGGCGCGC TCTATGTCGG GATGGTGCGC GAGCGGATGA GCAGCGAGGC CGCGATCCGC AGCTGCGCGC TGATCATGGG CGTGGCGATG GCGGCGGTGG CGATGAGCCG CTGGTCGATA CTCACCGCCG CGGCGCTGGT GATTGCGGGC GCGGTGTGGA TGCTGGCGAT CGCGCTGTTC AACATCGGCG TGCAGCTGTC GGCGCCGCGC TGGGTCGCCG GCCGCTCGCT GGCGGCGTTC CAGGCCTCGA TCGCCGGCGG CATCGCGATC GGGAGCTGGG GCTGGGGCCA CGTCGCCGAT CTCTCGGGCG TCGCGCCGTC GATGCTGATC TCGGCGGGCG CCATGGTCGT CTCGCCGCTG CTCGGGCTGT GGCTGCGGAT GCCGGCGGTC GGTACCCAGA TCGAGGACGC CGACCTCCTG GCCGACCCGG AAGTGCGGCT GGCGCTGACG CCGCGCAGCG GACCGCTGGT GGTCGAGATC GAGTACCGCG TCGATCCCGA CGACGCCCGC GCGTTTCACG GGGTGATGCA GCAGGTGCAG CTCAGCCGCC AGCGCAACGG CGCCTATGGC TGGTCGATCG CCCGCGACAT CGCCGATCCG GAATTGTGGA CCGAGCGCTA TCACTGCCCG ACCTGGCTCG ACTACTTGCG GCAGCGCAAC CGCTCGACCC AGGACGATCG CAATCTGCAT CAGCGCGCGA TGGCGTTCCA CCGCGGCGCA GACCCGATCC GGGTCCGCCG GATGCTGGAG CGGCCGTTCG GATCGGTGCG CTGGAAGGAC GATTCGCCCG ATCGCGCCAC CGGCACCGAA GTGCTGCCGG TCGCCGGCGT CAGCGGCGGC TCGACCTGA
|
Protein sequence | MTGRPKRGAF SAGGVAAPLR YALFRRIWLA SLLSNLGLMI NGVGAAWAMT QMTSSADKVA LVQTALMLPI MLVAMPAGAI ADMYDRRLVG LASLTLGLGG ATALAVLAHL GLVTPEILLA FCFVIGTGMA LFGPAWQASV SEQVPGEALP AAVALNGISY NIARSFGPAV GGIVVASAGA VAAFAANAVL YLPLLAVLLL WRRVSEPPRL PPERLNRAIV TGVRYIANSP SIRIVLARTL ITGIAGSSVL ALMPLVARDL LKSGAETYGI LLGAFGVGAV IGALYVGMVR ERMSSEAAIR SCALIMGVAM AAVAMSRWSI LTAAALVIAG AVWMLAIALF NIGVQLSAPR WVAGRSLAAF QASIAGGIAI GSWGWGHVAD LSGVAPSMLI SAGAMVVSPL LGLWLRMPAV GTQIEDADLL ADPEVRLALT PRSGPLVVEI EYRVDPDDAR AFHGVMQQVQ LSRQRNGAYG WSIARDIADP ELWTERYHCP TWLDYLRQRN RSTQDDRNLH QRAMAFHRGA DPIRVRRMLE RPFGSVRWKD DSPDRATGTE VLPVAGVSGG ST
|
| |