Gene Rpal_4593 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4593 
Symbol 
ID6412277 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4951807 
End bp4954905 
Gene Length3099 bp 
Protein Length1032 aa 
Translation table11 
GC content64% 
IMG OID642714473 
Productglycosyl transferase family 2 
Protein accessionYP_001993562 
Protein GI192292957 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGGCTC CGACATTCAG TATCGTGGTG CCGACCTACA ATCAGGCGCA GTATCTTGGA 
GCGTGTCTCG ACAGCATCGC CAGCCAGACC GACGGCGACT GGGAGGCAAT CGTGGTGGAT
GACGGCTCCA CCGACGGCAC CGCGGCGCTG GCCGACGACT ACGCGGCGCG CGATCCGCGG
TTCAGGGTCA TCCATCAACC CAATGGAGGC GTCGCCAGCG CGCTGAACGA AGGCCTGCGC
CAGGCACGTG GACAGTGGAT TCACTGGCTG TCGTCGGACG ATTTGTTCGA TCCGCGCAAA
CTCGAGATCA ACCGCGAGCA GATCCGACAG CATCCGGACT GCAAGTTCTT CTTCTCGTTC
TTCCGGCTAC TCCGGGAATC GACCCAGGAA TTGACCGATC ACGGGCTGTG GGGACCGTTG
CCGGATCGCG AGTCCCAGAT CCCTACCTTG TTTTTCCGGA ACTACATCAG CGGCATCACG
ATCTGCGTCG AGCGGACGGC CTGGCAATCG GTCGGATCGT TCGACCCGTC GCTGCGCTAT
GCGCAGGACT ACGACATGTG GCTGCGGCTG CTGGCCAGGT TTCCGGCCCG CTTCATCGAC
CAATGGACCG TCACCAATCG CAACCATGCC TTGCAGGGCT CAGAAGTCTT TCCGCAGGCC
TGCTACTATG ACACCGCCAA AGCCGCGATC CAGTTCCTCA ATCAGCATCG TTTCGAGGCG
CTGTTCCCGC TCATGGATCT CACCGATCCG GCGCAGGCGG TTCGCGCGCT CGATCACGCC
CTGACGATCA CTTCGGATCC CTCGGCGTTT CTGTACGCCC TGGGCTGGCA TCCGGCGCTG
CTCTGGCGCG CCCTGGAATG GATCGCGGCC ACCGAACGCC GCGATCCGGT GCTGGGCCGG
CGGCTCCGAA CCAGAGCCCG CTGGATCTGC CGACGGAACG CCCGCGCCTG GAGCGACCGC
AATAGCGCGT CGATCTGGAC CTCGATCGGC GCCGCGCTCG CGATGGAGGG CCTCACCACC
GTCTACCGGA GCCTCGACCC CGTCGATATC GCGGTCGACC GCTATTTCAC ACTGCGAGCG
GCCGGGGATC CCGCGTCCGC CGCATTGGCG GCCTATCTGA AACAGTTCCA CGGGCTGTCG
CTGCCGGAAC CGGCGTCCAC CGGCGATCAG GGCGGTACGC TCGCTGTCGT CCTCGACCAC
GCAACGGGCG AAGCCGAGGC CCTGGCCCAG GCGCGGCCTG TGGTCCGGGC GATGGCCGCC
CGTGGGTGGC GGACCGTTCT GTTCACGGCG GGTGCACCAT CGTTTCAGCT CGACTTCACC
ACGCCGGTGA TCTCGGGTCC TCGGAACAAG ATTGTCGGGC TGGCCGCACA ATTCCAGCCC
CGATGCGTTC TGTCCGCCGA CCCGACCAAC GCGGCATTTC CCCATGGCTT CCCAGTGCTG
CATGTGTCAG CAGCCGCTGA CAGCGCGGTG CTCGCCGGTT TCGTGACGGA TTTGAGCTGC
GAAGCCTCGA CGCAGCCGCC ACGGCAGGAG GCCCCCCCCA CAAGCCGGAT TCCGTTGGTT
TTTCTCACGC GAGCGCTGCA TGGCGGCGGC GCCGAACGGG CGCTTCAGAG CGTCGCCAGT
GCGTTGAACA GTCAGCTGTT CGACGTCCAC ATCGTGCCGT TGTTCAACTC CGCGATCTCA
CCCGAATTCG GTCACGCGAC AGTAGCGCCT TCGTTCGAGG CGGTATGGTT CGAACGGACC
AAGACCGAGG TTGTTCAACA GCAGGTACCT TTGCCGCAGA CTCCTCCAAC TGCGGAGACC
ATGGCCGCTC CGCGGCCGCG CTTCCCGGCC TGGGTCACGT TCATTGCCAG CAAGCTGACG
TCGGCGCAAA AGCAGCGAAT TCGGGCGAGC ATTGCGTTCA AGATCGCCCG GCTGGGCTGG
CGCGCGTTCA AGGCGGCTCG CCGCGAACTG ACTGCTGACG CCCCGCCCCC CTGCGTCGAC
GTCGCTCTAC CCCCGCTGTC GGAGGAATTG TCGCCCGCTG CCGCGCGTCC CGGGATCAAG
GCAACGGCGC TCACCCCGTA CGACAACATG GCCGACTACA TCGCGCAGAT CCTAGCGGGG
CTAGGCCATA GCGCTATCGT GATTTCGCTT ATGGAAGAGG CGACGATCGT GGCGTGGCTC
GCCTCATTGC GCACGCCGAT GCGTTATCTG GCATGGCTCC ACACCGTCGA AAGTCTCTAC
CTCGACCAGA TGTTCCCCGC TCCCGCCCAA CGAGCCAAGT TCGACATCCT GTTGCATGCC
GCGGTGGCCC GTTCGGAGCG ATGCGTCTTC CCGTCGCGCG GCTGTTGCGA CGATCTGACC
GAACTGTACG AACTGCCGCC GGACCATTTC CAGTGCATCT ACAATCCGAT CGATCTCGCC
ACAGTACGGC GGCTGAGTGA GCTGCCATTT GAAAGGCCGC TCGCTCCACA CCCCAACGTC
CCGATCCTGG TCAGCCTCGG CCGGCTATCG CCTGAGAAGG ATCATGCACA TCTACTGAAG
GCACTGAGCC TCCTGCGGCA GCGGGGTCGG GATTTCCTTT GCCTGATCAT CGGCGATGGC
GACCATGTGG GCGAGATCAC ACGATTGATC GAGCACCATG CGCTCGCCGA CCAGGTCAGA
CTTTTAGGGG CCGTCCAGAA CCCATTTCCG TATCTGGCGG CCGCCGACGC GTTGATCCTG
ACGTCGAAAT TCGAGTCCTT CGCGCTCGTC CTTGTCGAGG CGATGGCCTT GGAGGCGGTG
CCGGTGGCGG TCGACTGCCC GACCGGCCCG CGCGAAGTGC TGGATTGCGG ACAGGCGGGC
GTTCTCGTCC CTCCCGGCGA CGAACGCGCG CTGGCGGATG CGATCGAACA CATCGTGTGG
TCAGAAGCCG ATCATTCCGC GCTGCTGGCC ACCATGGCCG ACAGGTTGAA ATCATTTGAC
ATCGCGACCG TGGCTTTGCA ATGGGAACGC CTTGTCGGGA AGGTCCATGC CAAGGCAACT
CTTACCGATG AAGCATGCCC TGATTCGACG CGCCCCAACT TCGAAAGCGC AGCCGATGTT
CAGGATAGTG CGAGATACTC CGCCGGATTG CACCGCTGA
 
Protein sequence
MPAPTFSIVV PTYNQAQYLG ACLDSIASQT DGDWEAIVVD DGSTDGTAAL ADDYAARDPR 
FRVIHQPNGG VASALNEGLR QARGQWIHWL SSDDLFDPRK LEINREQIRQ HPDCKFFFSF
FRLLRESTQE LTDHGLWGPL PDRESQIPTL FFRNYISGIT ICVERTAWQS VGSFDPSLRY
AQDYDMWLRL LARFPARFID QWTVTNRNHA LQGSEVFPQA CYYDTAKAAI QFLNQHRFEA
LFPLMDLTDP AQAVRALDHA LTITSDPSAF LYALGWHPAL LWRALEWIAA TERRDPVLGR
RLRTRARWIC RRNARAWSDR NSASIWTSIG AALAMEGLTT VYRSLDPVDI AVDRYFTLRA
AGDPASAALA AYLKQFHGLS LPEPASTGDQ GGTLAVVLDH ATGEAEALAQ ARPVVRAMAA
RGWRTVLFTA GAPSFQLDFT TPVISGPRNK IVGLAAQFQP RCVLSADPTN AAFPHGFPVL
HVSAAADSAV LAGFVTDLSC EASTQPPRQE APPTSRIPLV FLTRALHGGG AERALQSVAS
ALNSQLFDVH IVPLFNSAIS PEFGHATVAP SFEAVWFERT KTEVVQQQVP LPQTPPTAET
MAAPRPRFPA WVTFIASKLT SAQKQRIRAS IAFKIARLGW RAFKAARREL TADAPPPCVD
VALPPLSEEL SPAAARPGIK ATALTPYDNM ADYIAQILAG LGHSAIVISL MEEATIVAWL
ASLRTPMRYL AWLHTVESLY LDQMFPAPAQ RAKFDILLHA AVARSERCVF PSRGCCDDLT
ELYELPPDHF QCIYNPIDLA TVRRLSELPF ERPLAPHPNV PILVSLGRLS PEKDHAHLLK
ALSLLRQRGR DFLCLIIGDG DHVGEITRLI EHHALADQVR LLGAVQNPFP YLAAADALIL
TSKFESFALV LVEAMALEAV PVAVDCPTGP REVLDCGQAG VLVPPGDERA LADAIEHIVW
SEADHSALLA TMADRLKSFD IATVALQWER LVGKVHAKAT LTDEACPDST RPNFESAADV
QDSARYSAGL HR