Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_4008 |
Symbol | |
ID | 6411690 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 4294825 |
End bp | 4295832 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 642713890 |
Product | hypothetical protein |
Protein accession | YP_001992979 |
Protein GI | 192292374 |
COG category | [S] Function unknown |
COG ID | [COG3181] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.829805 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATTTGT TTGACGGCGT GAGCGCAGCG GGGCGCGCTG CGATCGCCCT GATGGCACTA TGGTTGTTGC CTGCCTCGGC TGCCGAGCCG GACCGGTTGG ATCAGATCGA TTTTCCCGTT CGTACCGTCA CTGTTGTCGT TCCGTTCGCC AAGGGCGGAC CAACCGATAC CGTCGCCCGC CTGATTACCG CCGAGATGGC CAAGACGCTC GGCCAGCCGA TCGAAATCGA GAACATGCTC GGCGCCGGAG GTACACTGGC GGCGACCCGC GTGGCGCATG CGGCCCCGGA CGGCCACACC CTGATCGTCG GACACCTCGG AACGCACGGC GCCGCGGTAG CGCTGTTCCC CAAGCTCGCC TATCGGCCCG ACAAGGACTT CACGCCGGTC GCTCTGCTCA CCGAGATGCC GGTACTGCTG CTCGCCCGCA AGCAGTTCCC ACCGAAGGAC CTGAGCGAAT TCGCGTCCTA TGTGAAATCG CACACCGACA ACCTCAACGT CGCGCACGCC GGTTTCGGCT CGGTTTCGTA TGCGTCGTGC CTGCTGCTCA ACCGCCTGCT CAAAATCGAT CCGACCGGAG TGCCGTTCAG TGGCACCGGC CCGGCGCTGC AGGCGCTGGT CGAGGGGCAG GTCGACTACA TGTGCGACCA GATCGTCAAC GCGGTGCCGG CGCTCCGCGA GGGCAAAGTC AAAGCCTATG TGATTGCCGC GTCCGAGCGC GATCCCGTCG TTCCTGACGT GCCCACCGCG CGCGAAGCCG GCCTGCCGGG GTTCCAGGTC GGCGCCTGGA CCGGGCTGTT TGCACCGCGC GGCACTCCGG AACCGATCGT GGCCAAGCTC AATGCCGCCG TCTCCCGCGC CCTCGATCAG TCGGATGTTC GGACCCGGCT GACCGACCTC GGCGCCCTGG TGCCGCGGCC GGAACAGCGC GCTCCGGTGG TGCTGGCGCA GCTCGTCCAG GAAGAGATCT CACGCTGGGA AGACGTGGTG AAGGGCACAA CGCCCTAG
|
Protein sequence | MNLFDGVSAA GRAAIALMAL WLLPASAAEP DRLDQIDFPV RTVTVVVPFA KGGPTDTVAR LITAEMAKTL GQPIEIENML GAGGTLAATR VAHAAPDGHT LIVGHLGTHG AAVALFPKLA YRPDKDFTPV ALLTEMPVLL LARKQFPPKD LSEFASYVKS HTDNLNVAHA GFGSVSYASC LLLNRLLKID PTGVPFSGTG PALQALVEGQ VDYMCDQIVN AVPALREGKV KAYVIAASER DPVVPDVPTA REAGLPGFQV GAWTGLFAPR GTPEPIVAKL NAAVSRALDQ SDVRTRLTDL GALVPRPEQR APVVLAQLVQ EEISRWEDVV KGTTP
|
| |