Gene RPB_1601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1601 
Symbol 
ID3910072 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1804001 
End bp1805077 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content69% 
IMG OID637883497 
Productlipopolysaccharide heptosyltransferase II 
Protein accessionYP_485222 
Protein GI86748726 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0859] ADP-heptose:LPS heptosyltransferase 
TIGRFAM ID[TIGR02195] lipopolysaccharide heptosyltransferase II 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTACG ATTCACTCAA AATGAGCCTG GCCGTAGCGG CTGACCGCGC GGAAACCAGC 
CCGGTGCTGC TGATCCCTTA TATGTGGATC GGCGATTTCG TGCGCTGCCA TACGGTCGTG
CGGGTCCTGA AAGACCGCTG GCCGGACCGG CCGGTGGATG TCCTGACCAC GACGCTTTGC
GCACCTTTGG TGGATTACAT GCCCGGCGTG CGCCGGGGCG TCGTCTGGGA CCTGCCGCGC
AAGCGATTGG CGCTCGACCA GCAGCGGGCG CTGGCGGCGA AGCTCCGCGA GCAGCACTAC
GGCGCCTCGC TGGTGATGCC GCGGACGTTC AAATCCACGA TTGCGCCGTT TCTCGCCGGT
ATCCCGAACC GCACCGGCTT CATCGGCGAG GTCCGGTTCG GCCTGCTCAA CGACTGGCGG
CGCGGCGAGA AGGCGCTGCC GCGGATGATC GACCGTTGTG CCGCACTGGC GCTGCCGGCC
GGCATCGACC TGCCGATGGA TTGGCCGGAG CCGCAACTCG TTGTGCCGCC AGCCGAGATC
GCCGCCTGGC GGCGGGCCAA CGGGCTGGAG GGCCGGACTG CCGTGGCGCT GGCGCCAGGC
GCGGTCGGCC CGTCGAAGCG CTGGACCTAT TATGCCGAGG CGGCCAAAGC GCTGACCGAC
CGCGGTCTGG ACGTCTGGGT GATCGGCGGC CCCGGCGAGA GCGAGAAGGC CGCCGAAATC
GTCGCCGCGG CCGGTCCACG CGCGCGAGAC CTCACCGGCA CGGACCTGCG CAACGGCATC
ATGGCGCTGG CGGCGGCCGA TCTGGTGATC TCCAACGATT CCGGCCTGCT CCACGTCGCA
GCAGCGATCG GCAGCCGCAC CATCGGCATC TTCGGCCCGA CCAGCGCCTG GCATTATGCG
CCGCTCAACC CGATCGAGGC GGTGATCGAG ACCAGGACCG ACGTGCCCTG CCGCCCCTGC
CACAAACCGG TGTGCCGGAT GGTACATCAC AAATGCATGC GCGACATTCC GGTCGAGGAT
GTGATGGCGG CGGCGCAGCA AGCGCTGGGC AAGGCCGGCC TCGCCCCGGC GCGATAG
 
Protein sequence
MNYDSLKMSL AVAADRAETS PVLLIPYMWI GDFVRCHTVV RVLKDRWPDR PVDVLTTTLC 
APLVDYMPGV RRGVVWDLPR KRLALDQQRA LAAKLREQHY GASLVMPRTF KSTIAPFLAG
IPNRTGFIGE VRFGLLNDWR RGEKALPRMI DRCAALALPA GIDLPMDWPE PQLVVPPAEI
AAWRRANGLE GRTAVALAPG AVGPSKRWTY YAEAAKALTD RGLDVWVIGG PGESEKAAEI
VAAAGPRARD LTGTDLRNGI MALAAADLVI SNDSGLLHVA AAIGSRTIGI FGPTSAWHYA
PLNPIEAVIE TRTDVPCRPC HKPVCRMVHH KCMRDIPVED VMAAAQQALG KAGLAPAR