Gene Rpal_4470 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4470 
Symbol 
ID6412154 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4807017 
End bp4808342 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content65% 
IMG OID642714352 
Productglycosyl transferase group 1 
Protein accessionYP_001993441 
Protein GI192292836 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGATCAGA CCGCCGACCG GCACGAACAG CCCTGGCTGT GGATGGACGT CTCGACCAGT 
GCGCGGGCGC GCTCCGGCCA GATGAATGGC ACCCTTCGGG TTGAACAAAG TTACATTCGC
GCCTTGTCGG CCGAGATGGC TCCAGGGCTG CGGTTCTGTC GCTATGACCA ACTGCGGCGC
GACTACGTCG CGGTCGCCAC ACCGCCCGAC CTGAGCGGCA AGCCGGTCGC CGGCAAGGCC
AAGTCGAAGC AGGCGAGCGG GATCGCCGCT GTCCTAAAGC CGATTGGCAA ATCCGTCGAA
CGCACCGTCA AGACCGCGGT CCGCGGCGCG ACCGCGTCGC TGCTGCGCAA GGCCAGCCAG
GCCGAGCCGC TGCCGAAACT CGGTGGGGAC GGCCTCAGCG AGGTGCTGTT TCTCGCTGGT
GAGAACTGGT CGCGGGTAGA CTTTGCCACC GTCGCCCGGA TGCGCCGCGA GCGCGGCACC
AAAGTGGCGG CGCTGTGCCA GGACTTCATC CCGGCCGTGG CGCCACAGTT CTTCGCCGGC
GGCGACTTCG TCACCAAGTT CGACGCCTAC GCGCAGTTCT TGATCAAGGA AACCGATCTG
GTCGTCTCTA TCTCGGAGGC GACTAAGCGC GATATTCTCG GCTACGCCCA GCGCCACGGT
GGGATGCACG GGGCTGTCGA AATCGTGCAT CTCGGTGCCG ATATTCCCGC ACCACAGGCG
GCGCGGCGGC CGGAAGCGCT GACCGATGCT CAGGCTAAGC GCTTCGTGAT CAGCGTGTCG
ACTATTCAGT CGCGCAAGAA TTTCGATCTC TTGTACCACC TCTGGCACCG GCTCACGGAG
CAGAACACGC TCGCCCTGCC GACGTTGGTG ATTGTTGGCC AGCCGGGGTT CGGAAGTAGT
GATCTCTTGT GGCAGATCGC CAATGATCCG GTGACGGCCA ACTCGATCCT GCATCTGCCG
CGCGCCGGCG ATGATGAGCT GGCGTGGCTG TATCAGCACT GCTTGTTCAC GCTGTATCCG
TCGTTCTATG AGGGGTGGGG GTTGCCGGTA TCCGAGAGCC TCGCCTTCGG CAAATACTGC
CTCGCCTCCG ATGCCTCGTC GCTGCCGGAA GCCGGCGCAG GCCTCGCGCG CCACCTCGAT
CCGCTCGATT TCCCCGCCTG GCGTGCTGCC GTCCTTGACC TGATCGCGGC GCCTGAGCAA
CTTGCTCGCC ACGAAGCCGC GATCCGCGCC GGTTATCGCC CAGTCACCTG GGCTCAATCA
GCAACGCGAC TCGCCGACGT GCTACGCGGC CTGGCCGCGA CGGGGGCCTC TGCACACCCC
AGATAG
 
Protein sequence
MDQTADRHEQ PWLWMDVSTS ARARSGQMNG TLRVEQSYIR ALSAEMAPGL RFCRYDQLRR 
DYVAVATPPD LSGKPVAGKA KSKQASGIAA VLKPIGKSVE RTVKTAVRGA TASLLRKASQ
AEPLPKLGGD GLSEVLFLAG ENWSRVDFAT VARMRRERGT KVAALCQDFI PAVAPQFFAG
GDFVTKFDAY AQFLIKETDL VVSISEATKR DILGYAQRHG GMHGAVEIVH LGADIPAPQA
ARRPEALTDA QAKRFVISVS TIQSRKNFDL LYHLWHRLTE QNTLALPTLV IVGQPGFGSS
DLLWQIANDP VTANSILHLP RAGDDELAWL YQHCLFTLYP SFYEGWGLPV SESLAFGKYC
LASDASSLPE AGAGLARHLD PLDFPAWRAA VLDLIAAPEQ LARHEAAIRA GYRPVTWAQS
ATRLADVLRG LAATGASAHP R