Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_2224 |
Symbol | |
ID | 6409884 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | + |
Start bp | 2412346 |
End bp | 2413503 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 642712108 |
Product | hypothetical protein |
Protein accession | YP_001991220 |
Protein GI | 192290615 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0436] Aspartate/tyrosine/aromatic aminotransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAACA AGGTCTTCGC CGATCTGCCC GTCACCGTGT TCGAAGCGAT GTCGCAGCTC GCGCGCGACA ACGACGCGAT CAATCTCGGC CAGGGCTTTC CGGACGATCC GGGGCCGGAG GACATCCGTC GCGCCGCCGC CGACGCGGTG CTGAACGGCT ACAACCAGTA TCCGTCGATG ATGGGGCTGC CGGAGCTGCG TCAGGCGATC TCGACGCACT ACGCGCATTG GCATGGCGTC CAGCTCGATC CGATGACCGA AGTGATGGTC ACCTCCGGCG CCACGGAAGC GCTGGCGAGC GCCATCCTGT CGGTGGTCGA GCCCGGCGAC GAAGTGATCG TCTTTCAGCC GGTGTACGAC TCCTATCTGC CGATCATCCG GCAGGCCGGC GGCATTCCGC GCCTGGTCCG GCTCGAGCCG CCGCATTGGC GGATCACCGA GGAATCTCTG CGGCGGGTGT TCAACGCCAA GACCAAGGCG ATCGTCTTCA ACAATCCGCT CAACCCGGCT GCGGTCGTCT ATCCCCGCGA GGATCTCGAA TTGTTGGCGC GCTTCTGCCA GGAGTTCGAC ACGGTGGCGA TCTGCGACGA GGTGTGGGAG CACGTCACCT TCGACGGTCT CACCCACATC CCGCTGATCG CGATTCCGGG GATGCGCGAT CGCACCATCA AGATCGGCTC CGCCGGCAAG ATCTTCTCGC TCACCGGCTG GAAGGTCGGC TTCGTCTGCG CCGCGCCAAG GCTGCTGCGG GTCGCCGCCA AGGTTCACCA GTTCCTGGCG TTCACCACCG CGCCGAACCT GCAGGTCGCG GTCGCCTACG GGCTCGGCAA GTGCGACGAT TACTTCCTGC AGATGCGCAA GGATCTCGCC CGCAGCCGCG ATCGGCTGGC GCAGGGGCTG TCCAGCATCG GCTTTCCGGT GATCCGCTCG CAGGGCACCT ATTTCCTCAC CGTCGATCTG TCGCCGCTCG GTCTCAACGA GACCGACGAG GCGTTCTGCA AGCGGATCGT CACCGACTAC AAGGTCGCGG CGATTCCGGT ATCGGCGTTC TACGAGGAAG AGCCGGTCAC ATCCGTGGTG CGGTTCTGTT TCGCCAAAAA GGATCAGACG CTCGACACTG CCCTCGAGCG CCTGTCGGAT GCGGTTCACG GGCGATAG
|
Protein sequence | MSNKVFADLP VTVFEAMSQL ARDNDAINLG QGFPDDPGPE DIRRAAADAV LNGYNQYPSM MGLPELRQAI STHYAHWHGV QLDPMTEVMV TSGATEALAS AILSVVEPGD EVIVFQPVYD SYLPIIRQAG GIPRLVRLEP PHWRITEESL RRVFNAKTKA IVFNNPLNPA AVVYPREDLE LLARFCQEFD TVAICDEVWE HVTFDGLTHI PLIAIPGMRD RTIKIGSAGK IFSLTGWKVG FVCAAPRLLR VAAKVHQFLA FTTAPNLQVA VAYGLGKCDD YFLQMRKDLA RSRDRLAQGL SSIGFPVIRS QGTYFLTVDL SPLGLNETDE AFCKRIVTDY KVAAIPVSAF YEEEPVTSVV RFCFAKKDQT LDTALERLSD AVHGR
|
| |