Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_2177 |
Symbol | |
ID | 3909957 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 2470867 |
End bp | 2472105 |
Gene Length | 1239 bp |
Protein Length | 412 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637884071 |
Product | hypothetical protein |
Protein accession | YP_485794 |
Protein GI | 86749298 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0642236 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0211991 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGACGA TGACGATGAC GACGAAGAAG ATGACAGCCC TGGTTCGGAT CGGTGTCGTG TTGGCGGGCG CAGCGCTCGC CGCGTCTCCC GTCGCGGCGC AACAGCCGTC GCGCGAACCT GTGCAGGTCG CCTCCGGCGT CTCCTATCAA TATCTCGATC GCTGGGACGT CGATCGCCTC AACAAGATCC TGACCACCGA CACGCCGAAA TTCGCCGGCA TCAAGGTGAC CTACACGCCC GCCACCAACG CGGTGAAACT GTATCGCGTG ACCTATTCGT CGGTAGTGCC CGAGCGCGGC AACAAGCCGA TCGTCGCCAC CGGACTGATC GCGGTGCCCG ATACCAAGAC GACGTCGTTT CCGATGGTCT CCTATCAGCA CGGCACGGTG TACGGAAAGC AGGAGGTGCC GTCGTTCCCG GAGCAGTCGC CCGAGACCCA GCTGATGATC GCGCAATTCG CCGGGCAGGG CTATCTCGTG ATCGGCGCCG ACTACTTCGG CATGGGCAGC TCGACCGAGC CGGAAGGCTA CATGGTCAAG GGCAGCCATC AGCAGGCGAC CTACGACATG GTCGTCGCCA GCCGCGCCGT GCTCGCCGAT CTCAAGCTCG CCGACACCAA GCTGTTCCTC AGCGGCTGGT CGCAGGGCGG CTTCGTCACC CTGGCGATGC TGGAGAAGCT TGAAGCCGCC GGCATGAAGG TGAATGCGGC GGCCACCGCC AGCGCGCCGG CCGACGCTTT CGCCGCGTTC GAGGGCTTCC TCGACTTTCC GCGCAAGATC GACGCGAGCT GGATTCCGAC GATCTTCATC CTGACCGCGT TCTCGTTCGA GAACTACTAC GGCGTGCCGG GTCTCGCCCG CTCGCTGCTG ACCGAGGACA CGTATGAGAA CGCGCGCAAG GCCTATCTGC GCGAGCCGTT CGACCAGGCC GCGATCCCGA CCGATCTGAA GAAGCTGATC CGGCCGGAAT ATTTCGACAC GCACTACTTC GCCAATTCGG CCTATGGCCG CATCGTCGCG CAGACGCAGT CCTATCGCTG GGTGGTGAAG ACCCCGGTGC GCACCTATTA CGGCCTGACC GACGAGGCGA TCCGCACCGA GCTCGGGCAA CTGCCGATGA CCTGGCAGAA GGCGATGGGC AACGACAAGA TCGAGGCGGT GCCGACCGGC GACACCAGCC ATCGCGGCAC CTTCGCCACC GCCGTGCCGC AATGGAAGGC GTGGTTCGAC GGCAAGTAA
|
Protein sequence | MTTMTMTTKK MTALVRIGVV LAGAALAASP VAAQQPSREP VQVASGVSYQ YLDRWDVDRL NKILTTDTPK FAGIKVTYTP ATNAVKLYRV TYSSVVPERG NKPIVATGLI AVPDTKTTSF PMVSYQHGTV YGKQEVPSFP EQSPETQLMI AQFAGQGYLV IGADYFGMGS STEPEGYMVK GSHQQATYDM VVASRAVLAD LKLADTKLFL SGWSQGGFVT LAMLEKLEAA GMKVNAAATA SAPADAFAAF EGFLDFPRKI DASWIPTIFI LTAFSFENYY GVPGLARSLL TEDTYENARK AYLREPFDQA AIPTDLKKLI RPEYFDTHYF ANSAYGRIVA QTQSYRWVVK TPVRTYYGLT DEAIRTELGQ LPMTWQKAMG NDKIEAVPTG DTSHRGTFAT AVPQWKAWFD GK
|
| |