Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_4721 |
Symbol | |
ID | 6412407 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | + |
Start bp | 5081176 |
End bp | 5082735 |
Gene Length | 1560 bp |
Protein Length | 519 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 642714600 |
Product | hypothetical protein |
Protein accession | YP_001993687 |
Protein GI | 192293082 |
COG category | [S] Function unknown |
COG ID | [COG2187] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.358451 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCGCCC CAGACGCCAG CTCCCCCGGC TCCCAGCAGC AAGACGTCTT CGACTTCCTG GGCCGCGGGG CAGGCGACGC GCCCGTGGTG CAGATCGACA CCCATGGCGC AGCGGTTTTT CTCGAGGGGA ACCGGGCGCT GAAGATCAAG CGCGCCGTCA AGTTCCCGTT TCTGGACTAT TCAACCCTCG CCAAACGAAA GATCGCCTGC GAGCAGGAGC TCGAGGTCGG CCACCGATTC GCGCCGACGA TCTATCGGCG TGTCGTTCCG ATCACCCGCA CTGACAAGGA AGCGTTACAG ATCGGCGGCG AAGGACCAGC CGTGGAATGG GCCGTCGAGA TGATGCGCTT CGACGACAGC GCCACGCTGG ATCATCTCGC CCGCGCCGGT TCGCTCGGTC CTGAGCTGAT CGACGCCGTC GCCGATGCGA TCGCCGCCTC GCATCAGGCA GCGCCTTTGG CGGCGACTGC ACCATGGGTC GCATCGATCG AACCGATCCT GGCCGACGAC ACCAACGAGC TTGCCGCAGG CGGTTTTGCC GCCGCCGACG TCGCGGCGCT CGACAACGGC AGCCGCAATG CGCTCGGCCG GCTGCGCCCG TTGCTGGAGC AGCGCGGTGT GGCCGGCTTC GTTCGCTGGT GCCACGGCGA TCTGCACCTC GCCAATATCG TGGTGATCGA CGGCAAGCCG ACGCTGTTCG ACGCCATCGA ATTCGATCCG GCGCTCGCCT CGGTCGACGT GCTGTACGAT CTCGCCTTCC CGCTGATGGA CCTGCTGCAT TACGGCCGCG GCAGCGACTC CGCACAACTT TTGAACCGCT ATCTCGCGGT GACGAACGCG GACAATCTCG ATGCGCTGTC GACGCTGCCG TTGCTGCTGT CGATGCGCGC TGCGATCCGC GCCAAGGTGA TGCTGGCACG ACCCGCGGCC GATGAGACGA TCAGGCGAGC CAATCGGGCG ATTGCCGAAT CCTATTTCGA GCTGGCACTG CGGCTGATCG CGCCGCCCCG GCCCCGGCTG ATCGCGGTCG GCGGGCTGTC GGGCACCGGC AAGTCAGTGC TGGCTCGCGC TCTCTCCAGC AACGTCCCGC CCCTGCCCGG TGCCGTGGTG CTGCGCTCGG ATGTGGCCCG CAAACGGCTG CACGGCGTCG CCGACACTGA ACGGCTCCCG GCAACAGCCT ACACCACTGA AGTGACGGAG GCGGTGTATC GCGGTCTGGC TGAGCGCGCC GCGCATATCT TGAAACAGGG ACATTCGGTG ATCGTCGATG CGGTGTTCTC CAAGCCCGAG GAGCGCGACG CGATCGAAAG CGTCGCGGCC GGGCTTGGCA TCCCATTCCA CGGGCTGTTT CTCACCGCCG ATCTCGCCAC GCGGGTCGCG CGAGTCGCAG GCCGTACCGC AGATGCGTCC GATGCGACGC CGGAGATCGT CCGGCAGCAG CAAAGCTACG CGCAAGGCGT GATCGGCTGG ACCTCGATCG ACGCCGGCGG CACTCCGGCC GAGACGCTGT CGCGGGCGGT GGCGGCGTTG CCGCAGACCG CTCAGGTCTG CAGCACGTAG
|
Protein sequence | MPAPDASSPG SQQQDVFDFL GRGAGDAPVV QIDTHGAAVF LEGNRALKIK RAVKFPFLDY STLAKRKIAC EQELEVGHRF APTIYRRVVP ITRTDKEALQ IGGEGPAVEW AVEMMRFDDS ATLDHLARAG SLGPELIDAV ADAIAASHQA APLAATAPWV ASIEPILADD TNELAAGGFA AADVAALDNG SRNALGRLRP LLEQRGVAGF VRWCHGDLHL ANIVVIDGKP TLFDAIEFDP ALASVDVLYD LAFPLMDLLH YGRGSDSAQL LNRYLAVTNA DNLDALSTLP LLLSMRAAIR AKVMLARPAA DETIRRANRA IAESYFELAL RLIAPPRPRL IAVGGLSGTG KSVLARALSS NVPPLPGAVV LRSDVARKRL HGVADTERLP ATAYTTEVTE AVYRGLAERA AHILKQGHSV IVDAVFSKPE ERDAIESVAA GLGIPFHGLF LTADLATRVA RVAGRTADAS DATPEIVRQQ QSYAQGVIGW TSIDAGGTPA ETLSRAVAAL PQTAQVCST
|
| |