Gene Rpal_4587 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4587 
Symbol 
ID6412271 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4944435 
End bp4945724 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content63% 
IMG OID642714467 
Productglycosyl transferase group 1 
Protein accessionYP_001993556 
Protein GI192292951 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGCGA CGTCGACACA GGATCCGACA TTCGAGACCG ACGATTACCT CGGCGGACCG 
CGCGTGATGT TCGTGGGGTG GGCGGAGAGT TCGCACACCC ATTCCTGGAT CGATCTGCTG
CAGCAATCCG GCATCAACGC GCGACTGTTC GCCCTGCCCT CCGCCGTCCC ACCCGACAGC
TGGCAAGTGA GAACCTATGT GACGATCGCG GCGCCCAGCC CGATCGATTC TGCCTACCGC
AAGACGCTGA CGTCTGCGCC GACCGCGCCG CCGCCCAGGC GGAGTTTGTT TAGCTGGATC
AGGCCGGCTC CAACAGCACC CGCGGTGACC ATCGAGCGAC TGCTGGCGGA CGCCATCATG
CAGTGGAAGC CCGACGTCGT GCACACGCTC GGCCTTGATG CCGCGACCTA TCTGATGGAG
CGGACGCGCC AGCTGCATCC GGAAATCGTG GGAATCGGCC GTTGGGTGGT GCAGGTTCGC
GGCGGGCCGG ACCTAGCACT GCATCAATAC GACCCGGTTC ATCGTCCCAA GATCGAGAGC
ATCTTCGCGC AGTGCGATCA CCTGATCGCC GACAATCCGA TCAACTACGA AGACGCCGTC
AAGCTCGGGC TCGCCGCGGA GAAGACCTGC TCGCCGGGAC TGGGCGTCGT CTCCGGCCCG
GGCGGCATCG ACGTGCAGGG CCTGCGCAGC CGGTGGTCGC TCTTGCCTTC ACAACGGCCG
CGGACCATAT TGTGGCCGAA GACCTATGAA ACCATTTCCT CGAAGGCCCT CCCGGTTTTA
GAGGCTATTC GGCTTGCGTG GGACCGGATT GCTCCGGTGA CATTCGAAAT GCTCTGGCTG
GTCCAGAGCG ACGTCCGGAT CTGGTTCGAG AAGAGCATGC CGGACCACAT CAAAGCGTCC
TGCAACCTGT ACGGCCGCCT GGATCGCGAA CAGGTACTCG CCATGCTGCC GTCGGCGCGC
GTGATGCTGG CTCCGTCGCT GACGGACGGA ATTCCAAATT CGATGATGGA AGCCATGGCG
CTGGGAGCGT TTCCAATCGT ATCGCCGCTC GACACCATCA CCCCGGTCGT CAAGAACGAG
GAGAATGTTC TGTTCGCTCG CAATCTGTAT CCCGAGGAAA TCGCCGACGC GCTGGTCCGT
GCGATGCAGG ACGACGCGCT GGTCGATCGG GCCGCGGCCA ACAACGTGAT CAGAGTCGAC
GAACTGGCCA ACCGCGACCG CGTCAGGATT GCAGTCATCG ATTACTACAA GCAACTCACC
GCCCTGCAGC GCCAGACGAG GGCGGCATGA
 
Protein sequence
MSATSTQDPT FETDDYLGGP RVMFVGWAES SHTHSWIDLL QQSGINARLF ALPSAVPPDS 
WQVRTYVTIA APSPIDSAYR KTLTSAPTAP PPRRSLFSWI RPAPTAPAVT IERLLADAIM
QWKPDVVHTL GLDAATYLME RTRQLHPEIV GIGRWVVQVR GGPDLALHQY DPVHRPKIES
IFAQCDHLIA DNPINYEDAV KLGLAAEKTC SPGLGVVSGP GGIDVQGLRS RWSLLPSQRP
RTILWPKTYE TISSKALPVL EAIRLAWDRI APVTFEMLWL VQSDVRIWFE KSMPDHIKAS
CNLYGRLDRE QVLAMLPSAR VMLAPSLTDG IPNSMMEAMA LGAFPIVSPL DTITPVVKNE
ENVLFARNLY PEEIADALVR AMQDDALVDR AAANNVIRVD ELANRDRVRI AVIDYYKQLT
ALQRQTRAA