Gene Rpal_4167 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4167 
Symbol 
ID6411851 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4468638 
End bp4470404 
Gene Length1767 bp 
Protein Length588 aa 
Translation table11 
GC content66% 
IMG OID642714049 
Productmalto-oligosyltrehalose trehalohydrolase 
Protein accessionYP_001993138 
Protein GI192292533 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0296] 1,4-alpha-glucan branching enzyme 
TIGRFAM ID[TIGR02402] malto-oligosyltrehalose trehalohydrolase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.423754 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCTCC GCAGTTTCGG GCCGCGTCTC ACCGGTAACG GCGTCGAATT TCGCATCTGG 
GCGCCGAACG CGAAACGTGT CGATGTCGTG CTCGACCGTC CGCATCCGAT GCGGCGCGAC
GCCAAAGGTT GGTACACCAC CCACATCGAC GGCATCGGCG CCGGCGCGCG CTATCGCTTC
CTGATCGACG GCAGCCTCGA TGTACCGGAC CCGGCCTCGC TGTTTCAGCC CGAGGATCTG
GCCGGGCCGA GCGAAGTGAT CGACCACGCC GCATATCAAT GGCGCGCCGA GGATTGGCGA
GGGCGGCCCT GGGCCGAAAC CGTGCTGCTC GAAACCCATG TCGGCACCTT CACGCCGGAG
GGCAGCTTCC GCGCGATGAT CGACAAGCTC GACCATCTGG TCGACACCGG CATCACCGCG
CTGGAATTGA TGCCGCTCGC CGATTTCCCC GGCACGCGCA ATTGGGGCTA CGACGGCGTG
CTGTGGTATG CGCCGGACAG CGCCTATGGC CGTCCCGAGG ATCTCAAGGC GCTGATCGAC
GAGGCGCATC TGCGCGGCCT GATGGTGTTC CTCGACGTCG TCTATAACCA TTTCGGCCCC
GAAGGGAATT ACCTCGGCCA ATACGCACCC GGGTTCTTCT CCGATTCGCA CACGCCGTGG
GGCAACGGCA TCAACTACTA CGTCGAGCAG GTCCGCGCCT TCGCGATCGA GAACGCGTTG
TACTGGTTGG GCGATTATCG TTTCGACGGT CTGCGGCTCG ACGCGGTGCA TGCGATCCCC
GATCAGGGCG AAATCCCGAT GCTGGACGAG CTGTCGCGCG AAGTCGGCCG GCTCGCCGCG
GAGACCGGGC GCTACGTGCA TCTGGTGCTC GAGAACGACG ACAACACCGC CGCCGTGCTC
GATCCGGTCA CCCATCCGCC GCGGGGGCAA TATCGCGCGC AATGGAACGA CGACTACCAC
CACGCCTGGC ACGTCGCGCT GACCGGCGAG ACTCACGGCT ACTATTCAGA TTACGCCAAT
GCGCCGCTGG CGCATCTTGC CCGCGCGCTC GGCTCCGGCT TCGTATTTCA GGGCGAGCCT
TCGGAGCATC GCGGCGGCCA GCCGCGCGGC GAACCGAGCG GAGACCTCGT GCCGCTCGCC
TTCATCAACT TCCTGCAGAA CCACGATCAG ATCGGCAACC GCGCGCTCGG AGACCGGTTG
GAAAGCCTGG TCAAGCCGCA GGCGATCGAA GCGGCGCTCG CCGTCACGCT GCTGGCGCCG
ACGGTGCCGA TGCTGTTCAT GGGCGAGGAA TGGGGATCGC AGGCGCCGTT CCCGTTCTTC
TGCGATTTTC AAGGTGATCT TGCCGAAGCG GTGCGGGCCG GACGGCGCAA GGAATTCGCC
GGCGCCTACA AAGAGTACGG CAACGAGGTC CCCGATCCGC TCAGCAAGGA CGCGTTCGAC
AGCGCCGTGC TCGATTGGCA CGAGCGCGAC CAGGGCCGCG GCGCCGCGCG TCTGGCGCTG
GTCAAGCGCC TGCTCGCCAT CCGCCACCGC GAGATCGTGC CGCGGCTTGA CAGCTCGCGG
TTCGGCGACG CTCGGATCAC CGGCGACGGC CTGCTGAAGG CATTCTGGCG GCTCACCGAC
GGTGCGACGC TCGAACTCGT CGCCAATCTG TCGGACGACG AACGTGACGG CGGGGCGCCA
CCCGCGGCGG GAACGATTCT ATGGGGCGGC GATTGGAACC GGATCATTCC GCCTTGGGCG
GTGTCCTGGC GCCTCGAGCA TCCGTAA
 
Protein sequence
MTLRSFGPRL TGNGVEFRIW APNAKRVDVV LDRPHPMRRD AKGWYTTHID GIGAGARYRF 
LIDGSLDVPD PASLFQPEDL AGPSEVIDHA AYQWRAEDWR GRPWAETVLL ETHVGTFTPE
GSFRAMIDKL DHLVDTGITA LELMPLADFP GTRNWGYDGV LWYAPDSAYG RPEDLKALID
EAHLRGLMVF LDVVYNHFGP EGNYLGQYAP GFFSDSHTPW GNGINYYVEQ VRAFAIENAL
YWLGDYRFDG LRLDAVHAIP DQGEIPMLDE LSREVGRLAA ETGRYVHLVL ENDDNTAAVL
DPVTHPPRGQ YRAQWNDDYH HAWHVALTGE THGYYSDYAN APLAHLARAL GSGFVFQGEP
SEHRGGQPRG EPSGDLVPLA FINFLQNHDQ IGNRALGDRL ESLVKPQAIE AALAVTLLAP
TVPMLFMGEE WGSQAPFPFF CDFQGDLAEA VRAGRRKEFA GAYKEYGNEV PDPLSKDAFD
SAVLDWHERD QGRGAARLAL VKRLLAIRHR EIVPRLDSSR FGDARITGDG LLKAFWRLTD
GATLELVANL SDDERDGGAP PAAGTILWGG DWNRIIPPWA VSWRLEHP