Gene RPB_1007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1007 
Symbol 
ID3909131 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1152771 
End bp1154021 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content72% 
IMG OID637882900 
Productglycosyl transferase, group 1 
Protein accessionYP_484628 
Protein GI86748132 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.643015 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCATGA CGATCACCAC AGACACCCGG ACCGGGCACG ATCGCATCGG CGAGACCGAT 
GCAGCGTCAC CACAGCCGAC GCCGGCGCGC GACGCCGGGG CCGCGCCGGC GCGGCGCACG
CGCGTGGTCG TGATCCAGAC CCAGGCGGAG AACGCCGGCG CGCAGGAGAT TTCGCGGCTG
GTCGGCGCCG GGCTCGCCGC GCGCGGCTAC GACGTCCACA ATCTGTTCTT CTTCCGGCAG
TCGCGCTCGT TCGACGAGCC GGCGCAGACG ACGTATTGCG CGGCCCGCCG GCCGGGCGAT
CCGCTGTCGT TCCTGCGGTT TCTCGGCGCG CTCTACGCGC GCATCCGGAC GCTTCGGCCC
GACGTGGTGC TGACCTTCCA GCATTACGGC AATGCGATCG GCGGGATCGC CGCGCGGCTG
GCGAGCCCGG CGCCGGTGAT CGCCAACCAG GTGTCGGCGC GATTGACGAT GCCGGCCTGG
CTGCGCGGCG TCGATCGGAT CATGGGCCAG CTCGGCGTGT TCGAGACCAT CACGGTCAAC
TCGCACGACA TGCTGCGCGA CTATTCGCGC TATCCCGACG GCTATCGCAG GCGGCTGCAG
CACGTGCCGC ACGGCTTCGA CCAGAAGCAC GCGACCATGT CGAAGGCGGA CGCGCGCCGG
CAATTCGGGC TCAGGCCGGA TGCGGTCATT CTCGGCTCCG CCGCGCGGCT GCATCCGCTG
AAGCAGCTCG ACGCCGCCAT CCGCGTGCTG GCGCAGCGGC CGGACTGGCG CCTCGCGCTG
GCGGGCCAGG GCCCCGACGA GGCGCGCCTG CGCGAACTCG CCGACGGCCT CGGCGTGTCC
GACCGCATCA CCTTCATCGG CGAGATCTCG CCCGAGCAGG TCGCGAACTT CCTGGCCTGC
CTCGACGTGT TCGTGTTTCC CTCGCTGGCC GAGACCTTCG GCCTCGCCGC GGTCGAGGCC
GCCCATGCCG GCGTGCCGGT GGTCGCCAAC GATCTGCCGG TGCTGCGCGA AGTGCTGTCG
GCGCAAGGCG AACCGGCGGC ATTGTTCGTC GATGCGGCGG ACCCCGCCGC GATGGCGAAC
GCGATCGCCC GGGCGCTCGA CGACGACGCG CTCCGCGCGC AGCTCCGCCG CGCCGGCGAC
GGGCTGAAGT CGCGCTACGC GGTCGACGCC ATGGTCGACG AGTATGTCCG CATCATCGAG
GGCGCAACGC AGCCGGCAGC GCGACGGCAA GGAGCCGGCC GTGATCGTTG A
 
Protein sequence
MTMTITTDTR TGHDRIGETD AASPQPTPAR DAGAAPARRT RVVVIQTQAE NAGAQEISRL 
VGAGLAARGY DVHNLFFFRQ SRSFDEPAQT TYCAARRPGD PLSFLRFLGA LYARIRTLRP
DVVLTFQHYG NAIGGIAARL ASPAPVIANQ VSARLTMPAW LRGVDRIMGQ LGVFETITVN
SHDMLRDYSR YPDGYRRRLQ HVPHGFDQKH ATMSKADARR QFGLRPDAVI LGSAARLHPL
KQLDAAIRVL AQRPDWRLAL AGQGPDEARL RELADGLGVS DRITFIGEIS PEQVANFLAC
LDVFVFPSLA ETFGLAAVEA AHAGVPVVAN DLPVLREVLS AQGEPAALFV DAADPAAMAN
AIARALDDDA LRAQLRRAGD GLKSRYAVDA MVDEYVRIIE GATQPAARRQ GAGRDR