Gene Rpal_5168 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_5168 
Symbol 
ID6412868 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp5571290 
End bp5573341 
Gene Length2052 bp 
Protein Length683 aa 
Translation table11 
GC content69% 
IMG OID642715058 
Productglycosyl transferase family 2 
Protein accessionYP_001994131 
Protein GI192293526 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCCTAA CCGACCGCGC CGGACTGCAC GAGACGATGA CCGACGGGGG CGGCGATGAT 
GAGGCAACTG CCCCATCGCC CGGGCCGCCC CGCTTTCTCG CCGCTTGGCC TCCCCCGGTC
AGCAACGACA ATCATCAGGG CGATGAGTGG TGCGAGGCTC GGGACAGCCA ATCCGAAGCG
ATCGCAGCGC CGGAGCTGGA TTGCCTACGC GGCGTGCTGG CGCCGACACT GCTGCACGCC
GCCGCCGAAC GGGCCAAAGA CCTCGACATC GGAGCAGACC GCGTCCTGAT CCAGCAGGGT
CTGATCGATG AGGACGCGTA TCTCCGCTAT CTCACCCGCT GGCTTCGCCT CGGTTTCGAA
GACTTCGGCG AGTTCAGCCG CACCGACTGT CCGCTGGAAG ACGCGCAGAT CCCCTCCGCC
ACCGCGACCG GAATCGTGCC GCTCCGGGTC GATGACGAAT TGGTCTGGGT GGTGGCGCCG
CGGCGCCTTG TGAGCCATCG GCTGTGCGGG CTGCTGGACG ACTATCCGGA CACACGACCG
CGGCTACGAT TAGCCTCGGC GGCGGCACTG GAGACCTTTC TGGCGCGGCA GGGCGACCGC
GCGCTGGCCA ACATCGCCAG CGCCGATCTG CACCAGCGCC ATCCGATGCT GTCGGCGGCA
CCCCGCAAGG CCGGACCGAT ATGGCGGCAA CGCCTCAAGC GCGGAGCCTG CGTGGCAGCC
CTGCTCGCCC TGCCGTTCTT CCCTGATCCC GACGTCACGA CGACGCTGCT GGCGGCCTGG
TTCATCGGCT TTGCCGGATT GCGGCTGCTG GCCTGCTTGT GGCCGCGCCC GACCTTGCCA
CCGTCGCCGC GAAAGCCGGA CGCAGAGCTA CCGACCTATA CGGTGGTGGC GGCGCTGTAT
CGCGAAGCCG ACTCGGTGGG GCCGCTGGTC GAAGCCCTCG AAGCGCTCGA CTACCCGCGC
GAGAAGCTCG ACCTGATCCT GGTGATCGAG CCCGACGACC TCGCCACCCG CGCGGCGCTG
GCCCGGATCA AGCGGCGACC GCATCTGCGG GTGCTGATCG CGCCGGCCGT CGAACCAAGA
ACCAAGCCGA AGGCGCTGAA CTACGCCCTC GCCTTCGCGC GCGGCAGTTT CATTGCGGTG
TACGACGCCG AAGATCGGCC CGATCCCGAC CAATTGCGGG CGGCGCTGGC GGCGTTCGAC
GCCACCGGCC CGGAGACGGC TTGCGTGCAG GCCAGCCTGT GCATCGACAA CCTCACCCAT
AGCTGGCTGT CTCGCACGTT CCTCGCCGAA TATGCCGGTC AGTTCGACCT GTTTCTGCCG
GGCCTTGCGG CACTCGGCCT GCCGCTGCCG CTCGGCGGCA CCTCGAACCA TTTCCGCACC
GAGGTGCTGC GGGGCATCGG CGGATGGGAT CCGCACAACG TCACCGAAGA TGCCGATCTC
GGCTTCCGGC TCGCCCGGTT CGGCCACCGC TGCGTCACCA TCCCCTCGAC CACCTACGAA
GAGGCGCCGA TCGCCTTTCG CAATTGGCTG CCGCAGCGGG CGCGCTGGAT GAAGGGCTGG
ATCCAGACCT GGGAAGTGCA TATGCGGCAC CCGATCCGGC TGTGGCGCGA GATCGGATGG
CGTGGCGTGG TCGGCCTCAA CCTGGTGGTC GGCGGCAACG TGCTGTCGGC CCTCGCCTAT
CCGCTGCTGG TGCTGCTCGC CGTGATGTCC GCGGCAGACT GGGCGGATCT TGTGCCCGTC
TGGCTCGACG GCTGGCTGGA GCCCGCGGCG CCGAGCGCGC TGCACTGGCT CACAATCGCA
TCCGGCGTTG CATCGACATT GGTGGTCGGC TTGCTCGGCC TCGCCCGGCG GCGGCAATTG
CGGCACGCCG GGGCGTTGGC GCTGACGCCG CTCTATTGGC TGTGCCTATC GGTGGCGGCG
TGGCGCGCGC TGGCGCAATT CGTCTGGTGT CCGTACCGCT GGGACAAGAC TCAGCACGGC
GTTGCCCGCC GTCCGCATCC GCTGGCGCCG AAGCCAAAAG CCCGGCGGTC GCGATGGCGG
CTGTCGGCGT AA
 
Protein sequence
MVLTDRAGLH ETMTDGGGDD EATAPSPGPP RFLAAWPPPV SNDNHQGDEW CEARDSQSEA 
IAAPELDCLR GVLAPTLLHA AAERAKDLDI GADRVLIQQG LIDEDAYLRY LTRWLRLGFE
DFGEFSRTDC PLEDAQIPSA TATGIVPLRV DDELVWVVAP RRLVSHRLCG LLDDYPDTRP
RLRLASAAAL ETFLARQGDR ALANIASADL HQRHPMLSAA PRKAGPIWRQ RLKRGACVAA
LLALPFFPDP DVTTTLLAAW FIGFAGLRLL ACLWPRPTLP PSPRKPDAEL PTYTVVAALY
READSVGPLV EALEALDYPR EKLDLILVIE PDDLATRAAL ARIKRRPHLR VLIAPAVEPR
TKPKALNYAL AFARGSFIAV YDAEDRPDPD QLRAALAAFD ATGPETACVQ ASLCIDNLTH
SWLSRTFLAE YAGQFDLFLP GLAALGLPLP LGGTSNHFRT EVLRGIGGWD PHNVTEDADL
GFRLARFGHR CVTIPSTTYE EAPIAFRNWL PQRARWMKGW IQTWEVHMRH PIRLWREIGW
RGVVGLNLVV GGNVLSALAY PLLVLLAVMS AADWADLVPV WLDGWLEPAA PSALHWLTIA
SGVASTLVVG LLGLARRRQL RHAGALALTP LYWLCLSVAA WRALAQFVWC PYRWDKTQHG
VARRPHPLAP KPKARRSRWR LSA