Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_4593 |
Symbol | |
ID | 6412277 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | + |
Start bp | 4951807 |
End bp | 4954905 |
Gene Length | 3099 bp |
Protein Length | 1032 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 642714473 |
Product | glycosyl transferase family 2 |
Protein accession | YP_001993562 |
Protein GI | 192292957 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGGCTC CGACATTCAG TATCGTGGTG CCGACCTACA ATCAGGCGCA GTATCTTGGA GCGTGTCTCG ACAGCATCGC CAGCCAGACC GACGGCGACT GGGAGGCAAT CGTGGTGGAT GACGGCTCCA CCGACGGCAC CGCGGCGCTG GCCGACGACT ACGCGGCGCG CGATCCGCGG TTCAGGGTCA TCCATCAACC CAATGGAGGC GTCGCCAGCG CGCTGAACGA AGGCCTGCGC CAGGCACGTG GACAGTGGAT TCACTGGCTG TCGTCGGACG ATTTGTTCGA TCCGCGCAAA CTCGAGATCA ACCGCGAGCA GATCCGACAG CATCCGGACT GCAAGTTCTT CTTCTCGTTC TTCCGGCTAC TCCGGGAATC GACCCAGGAA TTGACCGATC ACGGGCTGTG GGGACCGTTG CCGGATCGCG AGTCCCAGAT CCCTACCTTG TTTTTCCGGA ACTACATCAG CGGCATCACG ATCTGCGTCG AGCGGACGGC CTGGCAATCG GTCGGATCGT TCGACCCGTC GCTGCGCTAT GCGCAGGACT ACGACATGTG GCTGCGGCTG CTGGCCAGGT TTCCGGCCCG CTTCATCGAC CAATGGACCG TCACCAATCG CAACCATGCC TTGCAGGGCT CAGAAGTCTT TCCGCAGGCC TGCTACTATG ACACCGCCAA AGCCGCGATC CAGTTCCTCA ATCAGCATCG TTTCGAGGCG CTGTTCCCGC TCATGGATCT CACCGATCCG GCGCAGGCGG TTCGCGCGCT CGATCACGCC CTGACGATCA CTTCGGATCC CTCGGCGTTT CTGTACGCCC TGGGCTGGCA TCCGGCGCTG CTCTGGCGCG CCCTGGAATG GATCGCGGCC ACCGAACGCC GCGATCCGGT GCTGGGCCGG CGGCTCCGAA CCAGAGCCCG CTGGATCTGC CGACGGAACG CCCGCGCCTG GAGCGACCGC AATAGCGCGT CGATCTGGAC CTCGATCGGC GCCGCGCTCG CGATGGAGGG CCTCACCACC GTCTACCGGA GCCTCGACCC CGTCGATATC GCGGTCGACC GCTATTTCAC ACTGCGAGCG GCCGGGGATC CCGCGTCCGC CGCATTGGCG GCCTATCTGA AACAGTTCCA CGGGCTGTCG CTGCCGGAAC CGGCGTCCAC CGGCGATCAG GGCGGTACGC TCGCTGTCGT CCTCGACCAC GCAACGGGCG AAGCCGAGGC CCTGGCCCAG GCGCGGCCTG TGGTCCGGGC GATGGCCGCC CGTGGGTGGC GGACCGTTCT GTTCACGGCG GGTGCACCAT CGTTTCAGCT CGACTTCACC ACGCCGGTGA TCTCGGGTCC TCGGAACAAG ATTGTCGGGC TGGCCGCACA ATTCCAGCCC CGATGCGTTC TGTCCGCCGA CCCGACCAAC GCGGCATTTC CCCATGGCTT CCCAGTGCTG CATGTGTCAG CAGCCGCTGA CAGCGCGGTG CTCGCCGGTT TCGTGACGGA TTTGAGCTGC GAAGCCTCGA CGCAGCCGCC ACGGCAGGAG GCCCCCCCCA CAAGCCGGAT TCCGTTGGTT TTTCTCACGC GAGCGCTGCA TGGCGGCGGC GCCGAACGGG CGCTTCAGAG CGTCGCCAGT GCGTTGAACA GTCAGCTGTT CGACGTCCAC ATCGTGCCGT TGTTCAACTC CGCGATCTCA CCCGAATTCG GTCACGCGAC AGTAGCGCCT TCGTTCGAGG CGGTATGGTT CGAACGGACC AAGACCGAGG TTGTTCAACA GCAGGTACCT TTGCCGCAGA CTCCTCCAAC TGCGGAGACC ATGGCCGCTC CGCGGCCGCG CTTCCCGGCC TGGGTCACGT TCATTGCCAG CAAGCTGACG TCGGCGCAAA AGCAGCGAAT TCGGGCGAGC ATTGCGTTCA AGATCGCCCG GCTGGGCTGG CGCGCGTTCA AGGCGGCTCG CCGCGAACTG ACTGCTGACG CCCCGCCCCC CTGCGTCGAC GTCGCTCTAC CCCCGCTGTC GGAGGAATTG TCGCCCGCTG CCGCGCGTCC CGGGATCAAG GCAACGGCGC TCACCCCGTA CGACAACATG GCCGACTACA TCGCGCAGAT CCTAGCGGGG CTAGGCCATA GCGCTATCGT GATTTCGCTT ATGGAAGAGG CGACGATCGT GGCGTGGCTC GCCTCATTGC GCACGCCGAT GCGTTATCTG GCATGGCTCC ACACCGTCGA AAGTCTCTAC CTCGACCAGA TGTTCCCCGC TCCCGCCCAA CGAGCCAAGT TCGACATCCT GTTGCATGCC GCGGTGGCCC GTTCGGAGCG ATGCGTCTTC CCGTCGCGCG GCTGTTGCGA CGATCTGACC GAACTGTACG AACTGCCGCC GGACCATTTC CAGTGCATCT ACAATCCGAT CGATCTCGCC ACAGTACGGC GGCTGAGTGA GCTGCCATTT GAAAGGCCGC TCGCTCCACA CCCCAACGTC CCGATCCTGG TCAGCCTCGG CCGGCTATCG CCTGAGAAGG ATCATGCACA TCTACTGAAG GCACTGAGCC TCCTGCGGCA GCGGGGTCGG GATTTCCTTT GCCTGATCAT CGGCGATGGC GACCATGTGG GCGAGATCAC ACGATTGATC GAGCACCATG CGCTCGCCGA CCAGGTCAGA CTTTTAGGGG CCGTCCAGAA CCCATTTCCG TATCTGGCGG CCGCCGACGC GTTGATCCTG ACGTCGAAAT TCGAGTCCTT CGCGCTCGTC CTTGTCGAGG CGATGGCCTT GGAGGCGGTG CCGGTGGCGG TCGACTGCCC GACCGGCCCG CGCGAAGTGC TGGATTGCGG ACAGGCGGGC GTTCTCGTCC CTCCCGGCGA CGAACGCGCG CTGGCGGATG CGATCGAACA CATCGTGTGG TCAGAAGCCG ATCATTCCGC GCTGCTGGCC ACCATGGCCG ACAGGTTGAA ATCATTTGAC ATCGCGACCG TGGCTTTGCA ATGGGAACGC CTTGTCGGGA AGGTCCATGC CAAGGCAACT CTTACCGATG AAGCATGCCC TGATTCGACG CGCCCCAACT TCGAAAGCGC AGCCGATGTT CAGGATAGTG CGAGATACTC CGCCGGATTG CACCGCTGA
|
Protein sequence | MPAPTFSIVV PTYNQAQYLG ACLDSIASQT DGDWEAIVVD DGSTDGTAAL ADDYAARDPR FRVIHQPNGG VASALNEGLR QARGQWIHWL SSDDLFDPRK LEINREQIRQ HPDCKFFFSF FRLLRESTQE LTDHGLWGPL PDRESQIPTL FFRNYISGIT ICVERTAWQS VGSFDPSLRY AQDYDMWLRL LARFPARFID QWTVTNRNHA LQGSEVFPQA CYYDTAKAAI QFLNQHRFEA LFPLMDLTDP AQAVRALDHA LTITSDPSAF LYALGWHPAL LWRALEWIAA TERRDPVLGR RLRTRARWIC RRNARAWSDR NSASIWTSIG AALAMEGLTT VYRSLDPVDI AVDRYFTLRA AGDPASAALA AYLKQFHGLS LPEPASTGDQ GGTLAVVLDH ATGEAEALAQ ARPVVRAMAA RGWRTVLFTA GAPSFQLDFT TPVISGPRNK IVGLAAQFQP RCVLSADPTN AAFPHGFPVL HVSAAADSAV LAGFVTDLSC EASTQPPRQE APPTSRIPLV FLTRALHGGG AERALQSVAS ALNSQLFDVH IVPLFNSAIS PEFGHATVAP SFEAVWFERT KTEVVQQQVP LPQTPPTAET MAAPRPRFPA WVTFIASKLT SAQKQRIRAS IAFKIARLGW RAFKAARREL TADAPPPCVD VALPPLSEEL SPAAARPGIK ATALTPYDNM ADYIAQILAG LGHSAIVISL MEEATIVAWL ASLRTPMRYL AWLHTVESLY LDQMFPAPAQ RAKFDILLHA AVARSERCVF PSRGCCDDLT ELYELPPDHF QCIYNPIDLA TVRRLSELPF ERPLAPHPNV PILVSLGRLS PEKDHAHLLK ALSLLRQRGR DFLCLIIGDG DHVGEITRLI EHHALADQVR LLGAVQNPFP YLAAADALIL TSKFESFALV LVEAMALEAV PVAVDCPTGP REVLDCGQAG VLVPPGDERA LADAIEHIVW SEADHSALLA TMADRLKSFD IATVALQWER LVGKVHAKAT LTDEACPDST RPNFESAADV QDSARYSAGL HR
|
| |