Gene RPB_1855 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1855 
Symbol 
ID3908050 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2120158 
End bp2121504 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content70% 
IMG OID637883749 
Productthree-deoxy-D-manno-octulosonic-acid transferase-like 
Protein accessionYP_485474 
Protein GI86748978 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1519] 3-deoxy-D-manno-octulosonic-acid transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.123633 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000681273 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGACGAGCG CGTCACCTCC GCGGTGGCAG ACGGGGCCTT TGCTGGCCGA ACCGTTGCCG 
ATGACGCTGC GCGTCTATCA GCAGCTCACC GCTGGTATCT CCCCGCTCGC CCCGCTGCTG
ATCCAGCGGC GGCTCAAGCA GGGCAAGGAA GAGCCGGCGC GGGTCGACGA GCGCCGCGGT
GTCGCCGCGC ATGTGCGGCC ACACGGACCG CTGGTGTGGA TCCACGGCGC CAGCGTCGGC
GAGGTGCTGG CCGCGGCCGG ACTGATCGAG CGGCTGCGCG CGCTCAACCT GCGCATCCTG
CTGACGTCCG GCACCGTGAC CTCGGCAGCC GTGGTGGCGA AACGGTTTCC GCCCGACATC
ATCCATCAGT TCATCCCCTA TGACGCGCCG CGCTTCGTCG CGCGCTTCCT CGATCACTGG
CAGCCGTCAT TGGCGCTGTT CGTCGAATCC GACCTGTGGC CGAATCTGAT CCTCGCCTCC
GCGGCGCGGC GGTTGCCGAT GGTGCTGATC AACGGCCGGA TGTCGCAGCG CTCGTTCCCG
CGCTGGCGGC GCGCCGCAGC GACGATCGGC ACGCTGCTCG GCAAGTTCGA CATCTGCCTC
GCGCAATCGC GGATGGACGC CGAGCGGTTC GCGGCGCTGG GCAGCCGCAA CGTCATCACC
ACCGGCAATC TCAAGATGGA CGTCGACCCC CCGCCGGGCG ATCCGGCGCG GCTGGAACGG
CTGATGGCGG TGACGCGCGG CCGGCAGGTG ATCGTCGCCG CCTCGACCCA TCCGGGCGAA
GAGGAGATCC TGCTCGACGT CCACCGCAGG CTTGCCGGGG CGTTCCCGGC GCTGCTCACC
GTGATCGTGC CGCGGCATCC GCATCGCGGC GAGCAGATCG CTGGCCTGAT TGAGGCCTCC
GGCCTGCACG CGGCGTTGCG CTCGCGCGAG CAACTGCCGA CCGCGGCGAC GGCGATCTAC
GTCGCCGACA CCATGGGCGA GCTCGGCCTA TTCTACCGGC TGGCGCCGAT CGTGTTCATG
GGCGGCTCGC TGATCGAGCA CGGCGGCCAG AATCCGATCG AGGCGGTCAA GCTCGGCGCC
TCGATCGTGC ACGGACCGCA CGTCTCCAAT TTCACCGACG TCTATCGCGC GCTCGACGAC
GAGGGCGGTG CCTTCACGGC GGCCGACGCC GACGCGCTGG TGCGGCGGTT CGGCCAGTTG
CTGTCCGACA GCAATGCGCG CCAGACCTCG ATCGATGCCG CCACCCGCGT GGTCGATCGG
CTCGGCGGCG CGCTGGACCG CACCGTCGCG GCGCTGGAAC CCTATCTGCT GCAATTGCGC
ATCGAGCAGG GCGCCGCCGG TGCGTGA
 
Protein sequence
MTSASPPRWQ TGPLLAEPLP MTLRVYQQLT AGISPLAPLL IQRRLKQGKE EPARVDERRG 
VAAHVRPHGP LVWIHGASVG EVLAAAGLIE RLRALNLRIL LTSGTVTSAA VVAKRFPPDI
IHQFIPYDAP RFVARFLDHW QPSLALFVES DLWPNLILAS AARRLPMVLI NGRMSQRSFP
RWRRAAATIG TLLGKFDICL AQSRMDAERF AALGSRNVIT TGNLKMDVDP PPGDPARLER
LMAVTRGRQV IVAASTHPGE EEILLDVHRR LAGAFPALLT VIVPRHPHRG EQIAGLIEAS
GLHAALRSRE QLPTAATAIY VADTMGELGL FYRLAPIVFM GGSLIEHGGQ NPIEAVKLGA
SIVHGPHVSN FTDVYRALDD EGGAFTAADA DALVRRFGQL LSDSNARQTS IDAATRVVDR
LGGALDRTVA ALEPYLLQLR IEQGAAGA