Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_2976 |
Symbol | |
ID | 3910775 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 3387666 |
End bp | 3388655 |
Gene Length | 990 bp |
Protein Length | 329 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637884882 |
Product | twin-arginine translocation pathway signal |
Protein accession | YP_486589 |
Protein GI | 86750093 |
COG category | [R] General function prediction only |
COG ID | [COG0491] Zn-dependent hydrolases, including glyoxylases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.340706 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACATGA CCCGACGTCA CGCCCTGACT GCCGCCGCCG CACTCGCGGC CGCGCCGTTG CTGCGCAGTG CTCCCGCCGA GGCGGCCGCC CCGCTCGCCG ACAAGCAGGC GCCGAGCTTC TACCGTTACA AGGTCGGCGA CGCACAGGTG AACGTGATCT CCGACGGCGT GAACAGCTTT CCGCTCGGCG ACAGCTTCGT GCTCAACGCC AAGAAGGACG AGGTCAACAA GGCGCTCGAA GCGGCCTTCC TGCCCAAGGA CAGGATCTCG ATCAACTTCG CCCCGCTGGT GATCAACACC GGCGGCAAGC TGGTCGTGGT CGACACCGGC AACGGCCCCG CCGCGTTTGC GTCGAGCAAG GGCAATGTCG GGCAGTTCGC CGGCAACATG GCGGCCGCCG GCCTCGATCC CAAGGCGGTC GACATCGTGG TGATCTCGCA TTTCCACGGC GACCACATCA ACGGCCTGCT CGGCGCCGAC AACCAGCCGG CGTTTCCCAA TGCCGAGGTG CTGGTGCCGG CGGCGGAGTG GAAGTACTTC ATGGACGACG GCGAGATGAG CCGCGCCTCC GGCGAGCGGA TGCAGGGCGT GTTCAAGAAC GCCCGCCGGG TGTTCGAGGC CGGGCTGAAC AAGAAGGTCA CGCCGTATGA ATGGGGCAAG GACGTCGCAC CCGGCCTGCT CGCGGTGGAA TCGGCCGGCC ACACCCCGGG GCACACCTCG TTCGTGCTGT CGTCCGGTTC GGACAAGGTG TTCATTCAGT CCGACATCAC CAATTTGCCG GCGCTGTTCG TCGCCAATCC CGGCTGGCAC CTGATGTTCG ACCAGGACCC GGCGATGGCC GAGACCACCC GCCGCAAGGT CTACGACATG CTGGTCGCCG ACAAGATGCG GGTGCAGGGC TTCCACTATC CGTTCCCCGC CAACGGCTAC GTCCAGAAGG ACGGCAGCGG CTATCGCCTG GTGCCGGCGC CGTGGAGCCC GGTGATCTGA
|
Protein sequence | MDMTRRHALT AAAALAAAPL LRSAPAEAAA PLADKQAPSF YRYKVGDAQV NVISDGVNSF PLGDSFVLNA KKDEVNKALE AAFLPKDRIS INFAPLVINT GGKLVVVDTG NGPAAFASSK GNVGQFAGNM AAAGLDPKAV DIVVISHFHG DHINGLLGAD NQPAFPNAEV LVPAAEWKYF MDDGEMSRAS GERMQGVFKN ARRVFEAGLN KKVTPYEWGK DVAPGLLAVE SAGHTPGHTS FVLSSGSDKV FIQSDITNLP ALFVANPGWH LMFDQDPAMA ETTRRKVYDM LVADKMRVQG FHYPFPANGY VQKDGSGYRL VPAPWSPVI
|
| |