Gene Rpal_3220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_3220 
SymbolmdoD 
ID6410890 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp3466007 
End bp3467518 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content66% 
IMG OID642713096 
Productglucan biosynthesis protein D 
Protein accessionYP_001992197 
Protein GI192291592 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3131] Periplasmic glucans biosynthesis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.321251 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGATTGA ACCGGCGGCA AGTTCTGACC GGGCTGGCGG CGTTGCCGCT GTTGCAGGCA 
AAGCCCGATC CCGCCGCGGC GGACCGCTCT TCCTCGATCG ACTTCGATCC TTGGATGGTG
CGCAAGCTGG CGCGCGAGCT GGCGAGCAAA CCTTATGAGG CGCCCGACAG CTCGTTGCCG
GCCTCGCTGA ACGATCTGAG CTACGACGCC TACCGGTCGC TGCGCTTTCG ACCCGAGCGC
GCCCTCTGGC GTGCCGAGAA CCTGCCGTTT CAGGTCCAGT TCTTCCACCG CGGCTTCCTC
TACAAGAACC GGGTGACGAT CTTCGAAGTC GCCGATGGCA AGGCGCGCCA CGTGCCGTAT
CGCGCGGACG ATTTCTCGTT CGGCGACGTC GCGCCGCCGC CGGATGCCGA TCTCGGCTTT
GCGGGGTTTC GGATTCACGC GCCGCTGCAG CGTGCGGACT ACTACGACGA GGTCAGCGCC
TTCCTCGGAG CGGCCTACTT TCGCGCCGTC ACCAAGGGCG AGCGTTACGG CCTCTCTGCG
CGCGGCCTGT CGATCGACAC CGGCCAGTCG AGCGGCGAGG AATTCCCGCT GTTCAAGACA
TTCTGGCTCG AGCGCCCTAC TCCGGGCGCA TCGTCGATGG TGGTGCATGC GCTGCTCGAC
AGCAAGAGCG TCGCCGGCGC CTACCGCTTC ACCATCCGCC CCGGCGACAC CACGGTGTTC
GACGTCGAGA TGGCGCTTTA TCCGCGCGTC GATCTGCAGC ATGCCGGACT GGCGCCGATG
ACCAGCATGT TTCTGTTCGG CCCGAACGAT CCGGCCGATA CCCCGGACTT CCGTGCCGCG
GTGCACGATT CCGACGGACT CGCGATCTTC AACGGCAGCG GCGAAGAGCT GTGGCGGCCT
CTGTGCAATC CGAGGGATCT GCAGATCAGC TCGTTCGGTG ACCGCAACCC GCGCGGCTTC
GGCCTGATGC AGCGCGAACG CAGCTTCGCT AACTATCAGG ATCTCGAATC CCGGTACGAG
CTGCGGCCGA GCCTGTGGGC CGAGCCGATC GGCGACTGGA CCGATGGCGC CGTCAAGCTG
ATCGAGATTC CCACCCGCGA AGAGGTGCAC GACAATATCG CGTCGTTCTG GGAGCCGAAG
CAGCCGCTAC GCGCCAAGGG CGAACACATC TACACCTATC GGCTGCACTG GGGACCGGAT
GCGCCGAAGC CCAAGGGGCT CGCGCGGTTC GTGCGGACCG GCATCAGCGC GCGCGGCGAC
AATGAACGGT TGTTCGTGCT CGATCTTGCC GGTGACCGGC TGAAGACCGT CGATGCCGCC
GCGGTCCGCG GCGTGGTCAC CGCCGACAAG GGCGAGATCC GCAATATCGT AACCCAGCCG
AACCCGGCGA TGGGCGGATG GCGGCTCAGC TTCGACCTCG CGCAGGCGCG AGCCCCGGTC
GAATTACGGG CGGTCGTGTG CGAGGGCGAC GCGGCGGTCT CGGAAGTTTG GCTGTACCGA
TGGACGCCGT GA
 
Protein sequence
MRLNRRQVLT GLAALPLLQA KPDPAAADRS SSIDFDPWMV RKLARELASK PYEAPDSSLP 
ASLNDLSYDA YRSLRFRPER ALWRAENLPF QVQFFHRGFL YKNRVTIFEV ADGKARHVPY
RADDFSFGDV APPPDADLGF AGFRIHAPLQ RADYYDEVSA FLGAAYFRAV TKGERYGLSA
RGLSIDTGQS SGEEFPLFKT FWLERPTPGA SSMVVHALLD SKSVAGAYRF TIRPGDTTVF
DVEMALYPRV DLQHAGLAPM TSMFLFGPND PADTPDFRAA VHDSDGLAIF NGSGEELWRP
LCNPRDLQIS SFGDRNPRGF GLMQRERSFA NYQDLESRYE LRPSLWAEPI GDWTDGAVKL
IEIPTREEVH DNIASFWEPK QPLRAKGEHI YTYRLHWGPD APKPKGLARF VRTGISARGD
NERLFVLDLA GDRLKTVDAA AVRGVVTADK GEIRNIVTQP NPAMGGWRLS FDLAQARAPV
ELRAVVCEGD AAVSEVWLYR WTP