Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_5031 |
Symbol | |
ID | 6412725 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | + |
Start bp | 5412344 |
End bp | 5413633 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 642714916 |
Product | hypothetical protein |
Protein accession | YP_001993995 |
Protein GI | 192293390 |
COG category | [S] Function unknown |
COG ID | [COG4487] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGATC TGACCATCAC CTGCCCGAAC TGCGCCTCGT CGGTGCCACT GACGGAGTCC CTGGCGGCGC CGCTACTGAA GGATACGCAG GCCAAATACG AGCGGCTGAT CAAGCAGAAG GATCAGGACA TCGCCGGGCG CGAGCAGGCG CTGCGGGCGC AGCAGGCGGA GGTCGAGAAG GCCAAGGCGG CGGTGGCACA GCAGGTCGCC GACCAGGTGA CGGCGGCGCG GGCGCGGATC GCCGCGGAGG AAGCCGCCAA GGCAAAGCGC CTCGCCGAAA ACGATCTCGC CGACAAGGCG CGGCAGCTCG CCGAGCTACA GGAGGTGCTG AAAAGCCGAG ACGCCAAGCT CGCCGAGGCG CAGCAGGCGC AGGCGGAGTT TGTGAAAAAG CAGCGGCTGC TCGAGGACGA GAAGCGCGAG CTTCATCTGA CGATCGAGAA GCAGGTCCAA GCCGGCCTCG ATGAAGCGCG GCAGAAGGCC CAGCAGGCCG CCGAAGATAA TCTGCGGCTC AAGGTCACCG AGAAAGAAGA GCAGATCGCC GCGATGCAGC GGCAGATCGA GGATCTGAAG CGCAAGGCCG AGCAGGGCTC GCAGCAATTG CAGGGCGAGG TGCTGGAGCT CGAACTCGAA GCCTCGCTGC GCGCCAAGTT TCCGCACGAC CAGATCGAGC CGGTGCCGAA GGGCGAATTC GGCGGCGACG TGCTGCAGCG GGTGGTGAGC GCCGCGGCGC AGCCGTGCGG CAGCATCCTG TGGGAATTCA AGCGCACCAA GAATTGGTCG GACGGCTGGC TGACCAAGCT GCGCGACGAC CAGCGCAAGG CCAAGGCCGA GCTGGCCCTG ATCGTCTCCA ACGCGTTGCC GAAGGGCGTG CACACCTTCG ACCATATCGA CGGCGTCTGG GTCACCGAAG CGCGCTGCGC GATTCCGGTG GCAATCGCGC TGCGGCAGTC GCTGATCGAG CTCGCCGCCG CGCGCCAGGC CGGCGTCGGC CAGCAGACCA AGATGGAGCT GACCTACCAG TACCTCACCG GTCCCGCATT CCGGCAGCGG ATCGAGGCGA TCGTCGAGAA GTTCACCGAG ATGCAGAGCG ATCTCGACAA GGAGCGTCGC TCGATGATGC GGATGTGGGC CAAGCGCGAG GCGCAGATCC GCGGCGTGCT CGAGGCCACC GCCGGGATGT ACGGCGATCT GCAGGGCATC GCCGGCAAAG CGCTGGCCGA GATCGACGGC ATGGCGCTGC CGATGCTGGA AGACTTCAGC GACGACGACG GCGACAGCGA AGCGGCGTAA
|
Protein sequence | MTDLTITCPN CASSVPLTES LAAPLLKDTQ AKYERLIKQK DQDIAGREQA LRAQQAEVEK AKAAVAQQVA DQVTAARARI AAEEAAKAKR LAENDLADKA RQLAELQEVL KSRDAKLAEA QQAQAEFVKK QRLLEDEKRE LHLTIEKQVQ AGLDEARQKA QQAAEDNLRL KVTEKEEQIA AMQRQIEDLK RKAEQGSQQL QGEVLELELE ASLRAKFPHD QIEPVPKGEF GGDVLQRVVS AAAQPCGSIL WEFKRTKNWS DGWLTKLRDD QRKAKAELAL IVSNALPKGV HTFDHIDGVW VTEARCAIPV AIALRQSLIE LAAARQAGVG QQTKMELTYQ YLTGPAFRQR IEAIVEKFTE MQSDLDKERR SMMRMWAKRE AQIRGVLEAT AGMYGDLQGI AGKALAEIDG MALPMLEDFS DDDGDSEAA
|
| |