Gene Rpal_3765 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_3765 
Symbol 
ID6411443 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4042276 
End bp4043397 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content60% 
IMG OID642713646 
Productglycosyl transferase group 1 
Protein accessionYP_001992739 
Protein GI192292134 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGGTC AAGCAGGACT CGTCGATTGC GACTTCACGC TCGCGCTCAA CAACAGGACC 
GGCAAGTTCT TCTTCTGCGG CGATCTGATC GCCGGCTCGA AGGATCTGAT CCGCGACGTC
TATTACTGGA GGCTTGGATT CGACCAGATT CCCACCGGTC TGATTGCCCG AATCCTGGGC
CGACTGGCCG TCGTAGAAAT CGATCTCCGG GTTCGGCATC CCAGGACACT CCCGACCTTC
TTGTCGCGCC GGGACAGGCT ACCAATCGTC TTTACCGATC CAAGAGAAGT CCTGATCCAC
GACCTGCGCG AAACCGATAT CATCCTTTGC CATGACGTCG GCCCGCTGAC CCACCCGACC
TTTTACGCCG ATGGCGTCGA GCAGATCTAC CGCGCGGCGT TCGATCGCAT TGCGGAGGCC
AAGCCGCATC TTCTGTTCGC CAGTGAAAGC TCCTGCGAGG AGTTCAAGCT GCTCTATGGC
GACGATTTCC CGTACTTGGG CGTTCTTTAC CCCCCGATCA GATTCGGCGC GGGCTCCTCA
GACCAGCAAC CGGTGACGTC TATCCCGGGC AAGTTCTTTC TGTCCGTCGG TAGCCTGGGA
ACGCGCAAGA ACCAGTTGCG GGCGATCGAG GCCTTCGGAC GAAGCGGCCT CGTCGAGAAG
GGATATCGAT ATGTGATTTG CGGCGGGCCG GAGCCTGGCG CCGAACACGT CATCGCCGCT
GCAGACCAGA CCCCCGGCGT CCTTATTCCG GGTTACGTCA ACGATCCGCA GCTTCGCTGG
CTGTACTCCC ACGCAGAAGG ATTCATTCTT CCGAGTTTGC TTGAAGGCTT CGGCTTGCCG
GCGGCCGAGG CGATTCACTA TGGGGTGATG CCATTGCTGA GCCGAGGCGG CGCTCTCGAA
GAGGTCGCAG GCCCATCGGC CATTCTTGTC GACCCGCTGG ATGTCGATGC GATCGTCCAA
GGAATGCATC AGATTGCAGT CATGAGCGAG GGGGAGAAGG CGCAACGCTT GGATCAGATG
CGAACGAGTA TTGCGAGATT TTCGACGGAA AACGCCTTAG GGGTCTGGCG ATCAGTTCTG
TCCCGCGCCG CTTCGCTTCA CCAGCATGTG GGCGCTAGCT GA
 
Protein sequence
MSGQAGLVDC DFTLALNNRT GKFFFCGDLI AGSKDLIRDV YYWRLGFDQI PTGLIARILG 
RLAVVEIDLR VRHPRTLPTF LSRRDRLPIV FTDPREVLIH DLRETDIILC HDVGPLTHPT
FYADGVEQIY RAAFDRIAEA KPHLLFASES SCEEFKLLYG DDFPYLGVLY PPIRFGAGSS
DQQPVTSIPG KFFLSVGSLG TRKNQLRAIE AFGRSGLVEK GYRYVICGGP EPGAEHVIAA
ADQTPGVLIP GYVNDPQLRW LYSHAEGFIL PSLLEGFGLP AAEAIHYGVM PLLSRGGALE
EVAGPSAILV DPLDVDAIVQ GMHQIAVMSE GEKAQRLDQM RTSIARFSTE NALGVWRSVL
SRAASLHQHV GAS