Gene RPD_4109 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_4109 
Symbol 
ID4024631 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4573977 
End bp4575323 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content69% 
IMG OID637964317 
Productthree-deoxy-D-manno-octulosonic-acid transferase-like 
Protein accessionYP_571229 
Protein GI91978570 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1519] 3-deoxy-D-manno-octulosonic-acid transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGGCG CGTCCACTCC GCGGTGGCAG ACGGGGCCTT CGTTGGCCGA ACCGTTGCCG 
ATGACGCTGC GCGTGTATCA AAAACTCACG GCCGGGTTTG CGCCGCTGGC CACGCTGCTG
ATCAAGCGTC GGCTCAAGCA GGGCAAGGAA GAGGCGGCGC GCGTCGACGA GCGCCGCGGC
GTCGCCGCGC ATGTGCGGCC GCACGGGCCG CTGGTGTGGA TCCACGGCGC CAGCGTCGGC
GAAGTGCTGG CCGCGGCCGG TCTGATCGAG CGGCTGCGCG CGCTCAACCT GCGCATTCTA
CTAACCTCCG GCACCGTGAC CTCTGCGTCG GTGGTGGAAA AGCGGTTTCC GCCGGATATC
ATCCATCAGT TCATTCCCTA CGACGCGCCG CGTTTCGTGG CGCGCTTTCT CGATCACTGG
CAGCCGTCGC TGGCGCTGTT CATCGAATCC GATCTGTGGC CGAACCTGAT CCTCGCGTCC
GCCGCGCGGC GGCTGCCGAT GGTGCTGATC AACGGCCGGA TGTCGCAGCG TTCGTTCCCG
CGCTGGCGGC GCGCCGCGGC GACGATCGGC ACGCTGCTCG GCAAGTTCGA CATCTGCCTC
GCGCAATCGC GGATGGATGC CGAGCGGTTT TCGGCGCTCG GCAGCCGCAA CGTCATCACC
ACCGGCAATC TCAAGATGGA TGTCGATCCG CCGCCGGCCG ATCCGGCGCG GCTGGAGCGG
CTGATGGCGG TGACGCGCGG CCGGCCGGTG ATCGTCGCCG CCTCGACCCA TCCGGGCGAG
GAGGAAATTT TGCTCGACGT CCACCGCACG CTCACCGGCG TGTTCCCGAC GCTGCTCACC
GTGATCGTGC CGCGGCATCC GCATCGCGGC GAACAGATCG GCGGGCTGGT CGAGGCCGTC
GGCTTGCAGA CCGCGCTGCG CTCGCGCGAG CAGCTTCCGA CCGCGGCGAC CGCGGTCTAT
GTCGCGGACA CCATGGGCGA ACTCGGCCTG TTCTATCGGC TGGCGCCGAT CGTGTTCATG
GGCGGCTCGC TGATCGAGCA TGGCGGCCAG AATCCGATCG AGGCGGTCAA GCTCGGCGCG
TCGATCGTGC ACGGCCCGCA CGTCTCGAAT TTCAGCGATG TCTATCGTGC GCTCGACGAC
GAAGGCGGCG CGTTCAGCGC GGGCGACGTC GACGCGCTGG TGCGCCGGTT CGGGCAGCTG
CTGTCCGACG ACCACGCGCG CCAGACCTCG ATCGATGCCG CCACGGCCGT GGTCGAGCGG
CTCGGCGGTG CGCTCGACCG CACGCTCTCC GCGCTCGAGC CCTATCTGCT GCAATTGCAG
ATCGAGCAGG GCGCCGCCGA TGCGTGA
 
Protein sequence
MTGASTPRWQ TGPSLAEPLP MTLRVYQKLT AGFAPLATLL IKRRLKQGKE EAARVDERRG 
VAAHVRPHGP LVWIHGASVG EVLAAAGLIE RLRALNLRIL LTSGTVTSAS VVEKRFPPDI
IHQFIPYDAP RFVARFLDHW QPSLALFIES DLWPNLILAS AARRLPMVLI NGRMSQRSFP
RWRRAAATIG TLLGKFDICL AQSRMDAERF SALGSRNVIT TGNLKMDVDP PPADPARLER
LMAVTRGRPV IVAASTHPGE EEILLDVHRT LTGVFPTLLT VIVPRHPHRG EQIGGLVEAV
GLQTALRSRE QLPTAATAVY VADTMGELGL FYRLAPIVFM GGSLIEHGGQ NPIEAVKLGA
SIVHGPHVSN FSDVYRALDD EGGAFSAGDV DALVRRFGQL LSDDHARQTS IDAATAVVER
LGGALDRTLS ALEPYLLQLQ IEQGAADA