Gene Rpal_4164 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4164 
Symbol 
ID6411848 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4460873 
End bp4464175 
Gene Length3303 bp 
Protein Length1100 aa 
Translation table11 
GC content65% 
IMG OID642714046 
Producttrehalose synthase 
Protein accessionYP_001993135 
Protein GI192292530 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases
[COG3281] Uncharacterized protein, probably involved in trehalose biosynthesis 
TIGRFAM ID[TIGR02456] trehalose synthase
[TIGR02457] trehalose synthase-fused probable maltokinase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAAAA TGACACCGAT CGCCCCGAGC GACAGGCCGG ACGCGGAAAC TTCTGACGAA 
CTCTGGTACA AAGACGCCAT CATCTACCAG TTGCACGTCA AGGCGTTCGC CGACAGCAAC
AATGACGGCA TCGGCGACTT CGCGGGTCTC ACCGAGAAGC TGGATTATCT GCAGGAGCTT
GGCGTCAATA CGCTGTGGCT GCTGCCGTTC TATCCCTCGC CGCAGCGCGA CGACGGATAC
GACATCGCCG ACTACGGATC GATCAATCCC GACTTCGGCA CGATGAAGGA CTTCCGCCGC
TTCATCGTCG AGGCCAAGAA GCGCAATCTG CGGGTCATCA CCGAGCTGGT CATCAACCAC
ACCTCGGATC AGCACGCCTG GTTCAAGCGG GCACGACGCA GCCCGAAGGA CTCCAGCGCG
CGCGACTGGT ACGTCTGGAG CGACACCGAC CAGAAGTATC TCGGCACCCG GATCATTTTC
ACCGATACCG AGAAATCGAA CTGGACCTGG GATCCCGAGG CCGGCCAGTA TTACTGGCAC
CGGTTCTTCT CGCACCAGCC GGATCTCAAC TTCGACAATC CGCGCGTCGT CAGCGCCGTG
GTCAAGGTGA TGAAACGCTG GCTCGACACC GGCGTCGACG GCTTCCGGCT CGACGCGATC
CCGTATCTGT GCGAGCGCGA CGGCACCAAC AACGAGAACC TTCCCGAGAC CCACGCCGTC
ATCAAGCGGC TGCGCGCCGA GCTCGACGCC TACGCCAAGG GCAAGCTGCT GCTCGCCGAG
GCCAATCAAT GGCCGGAGGA CGTGCAGCAG TATTTCGGCG ATGGCGACGA ATGCCACATG
GCCTACCACT TCCCGCTGAT GCCGCGGATC TATATGGCGA TCGCGCAGGA AGACCGTTTC
CCGATCACCG ACATCATGCG GCAGACGCCG GAGATCCCGG CGAACTGCCA GTGGGCGATG
TTCCTGCGCA ACCACGACGA GCTGACGCTC GAAATGGTCA CCGACGTCGA GCGCGACTAT
CTGTGGAAGA CCTACGCGGC CGATCCGCGA GCGCGGATCA ACGTCGGCAT CCGTCGCCGC
CTCGCGCCGC TGATGGACAA TGATCGCCGC AAGATCGAAC TAATGAACTC GCTGCTGATG
TCGTTCCCCG GCACGCCGAT CATCTACTAT GGCGACGAAA TAGGCATGGG CGACAACATC
TATCTCGGCG ACCGCAACGG CGTGCGGACA CCGATGCAGT GGTCGCCTGA CCGCAACGGT
GGATTTTCGC GGGCCGACCC GGCGCGGCTG TACGCGCCGC CGATCATGGA CCCGGTGTAC
GGCTACGCCT CGGTCAACGT CGAAGCCCAG CAGCGCAGCC TGTCGTCGTT GCTGAGCGCG
ATGAAGCGCT TGATCGCGGT GCGCAAATCG ACGCCGGCGT TCGGCCGCGG CAGCATCACC
TTCATCCGTC CGAGCAACCG CGCGGTGATC GCCTACGTTC GCCAATACAA GGACGAGGTC
ATCCTGTGCG TCGCCAATCT GTCGCGTGCC GCGCAGGCGA CCGAACTCGA CCTGTCGCCG
TGGAAGGACC GCGTGCCGCA GGAGATGCTC GGCCGCACCA AATTCCCGCC GATCGGCGAT
CCGGCCTACA TGATCACGCT GGCGCCCTTC GGCTTCTACT GGTTCAAGCT GCAGGAGCCC
GGCACGGCCG CGCATGTCGC GCCGGTCTCC ACCGTGCCTG AATTCGTCAC GCTGGTGGTG
CCGCTCGGTT CGACCTGGAT GACGCTCGGC CGCACCCGCT CGATGTTCGA GCACGAGGTG
CTACCGACCT TCCTGTCCCG CACCCGCTGG TTTCCGGAGC GCAATCCGCG GGCGATCCAT
CCGCGGCTCA CCTCGGCGAT TCCGTTCGCC AATGACGGCG ACAACCGTCC GTGGCTGGCG
TTCTTCGAGG CCACGCAGCG GGGGACGACG GCGCGCTATC TGCTGCCGAT GCAGATCGAC
TGGGTGCGGT TCGATCGCGA ACGCTACAAT CCGCGGGCCT ATGCGGCGGT GCGCCAGGGC
GCCCGCGAAG GCACCCTGCT CGATGTCGCC GCCGATCCCT CGTTCATCGA CCTGCTGCTG
GAGAACGTCC GCGACGCCGT CACCGTGACC GGCGATAACG GCGACCGACT GGAATTCCGC
CCTGGCTCCA AGCTCGCGGA ACGGCCCGCC GGACCGTTCC AGAACATTCG CGCGGTCGAA
ACCGAGCAGT CGAATTCGAC CGCACTGGTC GACGGCGACT ACGTCGTCAA ACTGTATCGG
CGGCTGCAGA TCGGCATCAA TCCCGAGCTG GAGATGGGAC GCTTCCTCAC CGAGGTCGCC
GGCTATGCCA ACACCCCGGC CCTGCTCGGC AGCGTCGAGC TGATCGAAGG CGACAAGACC
AGCGCGGTCG CGGTGGTGCA CGAATTCGCC CGCAATCAGG GCGACGGCTG GACCGTCACC
TCCGGCTATC TCGATCGCTA CATCGACGAA CAACGCGTGC TGAGCCGCGC CGAGGAAAAG
ACCGAGAGCG ATCAGCTCGC GCCCTACCTG CATTTCATGC AGCAGACCGG CAAGCGGGTC
GCGGAGATGC ACATCGCGCT CGCCAGCCAT CCCGAGGTGC CGGACTTCGC CCCGGAGCCG
ATCACCCTCG ACGCCTCGCG CGACTGGGCC GAAACCGTTG CGGCATCCGC CGACCGGATG
CTCGACGAAC TGCGGCGCCG GCGCGACACG CTGAAGGAAG GCGACCGGAC GCTGGTCGAC
GAACTGCTGG CGGCCCGCGA CGTGCTGATG TCCCGCATCC ACGGCCTGCT CGGCGACGAC
GGCGGCCTGA ACATCCGCCA TCACGGCGAC TTCCATCTCG GCCAGATGTT GATCGTCAAG
GACGACATCT ACATCATCGA CTTCGAAGGT GAGCCGCGCC GCACCTTGGA CGAACGCCGC
GCCAAAGCAC CGGCGGCTCG CGACGTGGCC GGGCTGATCC GGTCGATCGA CTATTCGACG
ACCGCCGCTC TCGAGCGAGC GCTGAAAGCA GCCTCCGACG AGCCCGGCCG GCTGACCGAG
GCGCTCGACC TGTGGCGGAT CCGCGCCACA GGCGCGTTCC TGGACGCCTA TCGCCAAACC
ATGGGCGACA GCCCGGTGTG GCCGGCGGAT ATCGCGGCCG CCGACCGGGT GCTCGACTTC
TTCCTGATTG AAAAGGCGCT GTACGAGATC GACTACGAGA TCGCGCATCG TCCCGACTGG
GTGCATGTGC CGCTGGCCGG CATTCTCCGT ATCCTGTCGC CGCCTCCCGA GGAGCTTCCA
TGA
 
Protein sequence
MNKMTPIAPS DRPDAETSDE LWYKDAIIYQ LHVKAFADSN NDGIGDFAGL TEKLDYLQEL 
GVNTLWLLPF YPSPQRDDGY DIADYGSINP DFGTMKDFRR FIVEAKKRNL RVITELVINH
TSDQHAWFKR ARRSPKDSSA RDWYVWSDTD QKYLGTRIIF TDTEKSNWTW DPEAGQYYWH
RFFSHQPDLN FDNPRVVSAV VKVMKRWLDT GVDGFRLDAI PYLCERDGTN NENLPETHAV
IKRLRAELDA YAKGKLLLAE ANQWPEDVQQ YFGDGDECHM AYHFPLMPRI YMAIAQEDRF
PITDIMRQTP EIPANCQWAM FLRNHDELTL EMVTDVERDY LWKTYAADPR ARINVGIRRR
LAPLMDNDRR KIELMNSLLM SFPGTPIIYY GDEIGMGDNI YLGDRNGVRT PMQWSPDRNG
GFSRADPARL YAPPIMDPVY GYASVNVEAQ QRSLSSLLSA MKRLIAVRKS TPAFGRGSIT
FIRPSNRAVI AYVRQYKDEV ILCVANLSRA AQATELDLSP WKDRVPQEML GRTKFPPIGD
PAYMITLAPF GFYWFKLQEP GTAAHVAPVS TVPEFVTLVV PLGSTWMTLG RTRSMFEHEV
LPTFLSRTRW FPERNPRAIH PRLTSAIPFA NDGDNRPWLA FFEATQRGTT ARYLLPMQID
WVRFDRERYN PRAYAAVRQG AREGTLLDVA ADPSFIDLLL ENVRDAVTVT GDNGDRLEFR
PGSKLAERPA GPFQNIRAVE TEQSNSTALV DGDYVVKLYR RLQIGINPEL EMGRFLTEVA
GYANTPALLG SVELIEGDKT SAVAVVHEFA RNQGDGWTVT SGYLDRYIDE QRVLSRAEEK
TESDQLAPYL HFMQQTGKRV AEMHIALASH PEVPDFAPEP ITLDASRDWA ETVAASADRM
LDELRRRRDT LKEGDRTLVD ELLAARDVLM SRIHGLLGDD GGLNIRHHGD FHLGQMLIVK
DDIYIIDFEG EPRRTLDERR AKAPAARDVA GLIRSIDYST TAALERALKA ASDEPGRLTE
ALDLWRIRAT GAFLDAYRQT MGDSPVWPAD IAAADRVLDF FLIEKALYEI DYEIAHRPDW
VHVPLAGILR ILSPPPEELP