Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_2389 |
Symbol | |
ID | 6410051 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 2573593 |
End bp | 2575242 |
Gene Length | 1650 bp |
Protein Length | 549 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 642712268 |
Product | protein of unknown function DUF894 DitE |
Protein accession | YP_001991378 |
Protein GI | 192290773 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAGTA TTGAAGGCGA CCAGCAGACG GCGGCTCCGC CACGCCCCTC GTCCTGGGCA GCCTTTCACC ACATTGCCTT CACGGTGGTG TGGACGGCCA CCGTGGTGTC CAACGTCGGC ACCTGGATGT TCAATGCTGC CTCCGGCTGG CTGATGACCA GCCTTGAGGC CGACCCGCTG CAGGTCGCGC TGGTACAGGT CGCGTCCAAC CTGCCGATCT TCCTGTTCGC CATTCCGGCT GGCGCGCTCG CCGATATCGT CGACAAGCGT AAGTTTCTGA TCGTCGTGCA GATCGTCCTG ACCTTGTTTG CGAGCGCCAG CGCCGCGCTG GTCTGGTTCG GCCTGATGAC GCCGCCGCAG CTGCTGCTGT TCACCTTCCT GCTCGGCGCC GGCGCCGCCT TTGCAGCCCC GGCATTTCAA TCGATCGTGC CGGATCTGGT CCCGAAGCAG GACCTCGCCT CCGCGGTCGC CAGCAACGGC GTCGGCATCA ACATCAGCCG GGCAATCGGT CCGGCGCTGG GCGGCGTCGC GATCGGCAGC CTTGGCATCG CCGCGCCATT TTGGATCAAC GCGGTCAGCA ACCTTGCGGT GATTGCTGCG CTGATCTGGT GGCGGCCGGC CAATTCGCGT AGCGGCAGCC TGCCGCCGGA GCGACTGATC GAAGCGCTGG TCACCGGCAT CCGTCACGCC CGGCACAACA CCGAGTTGCG CGCGACGCTG GTCCGAGCCG TGGCGTTCTT CTTCTTTGCC AGCGCCTATT GGGCGCTGCT GCCGCTGGTT GCGCGATCGC GTATCGCCGG CGGCCCCGAG CTTTACGGCA TCCTGCTCGG CGCCGTCGGT GCCGGTGCGA TCGGTGGGGC GCTCGTGTTG CCGAAGCTGA AGGCCACGCT CGGCCCGGAC CGGCTGGTTG CCGCCGGCAC GCTCGGCACC GCGCTGTGCC TGGTGATGCT CGGCCTTGCC CGACAGGTTG AACTCGCCGT GGCCGCGTGT CTGCTCGCCG GCATCTCGTG GATCGCTGTG CTGGCGAGTT TGAACGTTTC GGTGCAGGTC GTGCTGCCGG ATTGGGTACG CGGACGCGGG CTGGCGATGT TCGTCACCGT GTTCTTTGGC GCGATGACCG CAGGCAGTGC GCTGTGGGGC CAGATCGCCT CGTCGTTCGG TCTGCCGATC GCGCATTTCG CCGCCGCGGC CGGCGCGGTG CTCGGCATCG CGGCGACGTG GCGTTGGAAA CTGAAAGCAG GCGTAGACGT CGACCTGTCA CCGTCGATGC ACTGGCCGAC ACCGATCGTC AGCATCGATG CCGAGCCCGA TCAGGGGCCA GTGCTGATCA CGGTCGAGTA CCGGATCGCA AAGGGCCATC GCGACGCGTT TCTCTCCGCC GTCCGGCATC TCAGCCGGCA GCGGCGGCGC GACGGCGCCT ATCAATGGGG CGTATTCGAG GATGCGGCCG ATCCTGGCCG CTTTGTCGAG ACCTTCAAGG TCGCCTCATG GCTGGAGCAT CTGCGCCAGC ACGAACGCGT CACCAACGCC GATCGGGTGC TGCAGGAGCA GATCCGCTTG TGCGACGCCG AACCGAAAGT GACGCACCTG ATCGCCGCGC CACATGGGCA CGTCAAACGG CAGCCGACGC ATGCGTCGCA AACGCCGTGA
|
Protein sequence | MSSIEGDQQT AAPPRPSSWA AFHHIAFTVV WTATVVSNVG TWMFNAASGW LMTSLEADPL QVALVQVASN LPIFLFAIPA GALADIVDKR KFLIVVQIVL TLFASASAAL VWFGLMTPPQ LLLFTFLLGA GAAFAAPAFQ SIVPDLVPKQ DLASAVASNG VGINISRAIG PALGGVAIGS LGIAAPFWIN AVSNLAVIAA LIWWRPANSR SGSLPPERLI EALVTGIRHA RHNTELRATL VRAVAFFFFA SAYWALLPLV ARSRIAGGPE LYGILLGAVG AGAIGGALVL PKLKATLGPD RLVAAGTLGT ALCLVMLGLA RQVELAVAAC LLAGISWIAV LASLNVSVQV VLPDWVRGRG LAMFVTVFFG AMTAGSALWG QIASSFGLPI AHFAAAAGAV LGIAATWRWK LKAGVDVDLS PSMHWPTPIV SIDAEPDQGP VLITVEYRIA KGHRDAFLSA VRHLSRQRRR DGAYQWGVFE DAADPGRFVE TFKVASWLEH LRQHERVTNA DRVLQEQIRL CDAEPKVTHL IAAPHGHVKR QPTHASQTP
|
| |