Gene RPB_4101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4101 
Symbol 
ID3911908 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4669760 
End bp4670932 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content67% 
IMG OID637886005 
Productglycosyl transferase family protein 
Protein accessionYP_487705 
Protein GI86751209 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0116147 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.150084 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTTGAAC AGTCATCGAC CAGGATCTCC GCCGCGACTT ACGGCGCTGT GGTCGCCATT 
CCTGCTCACA ATGAGGCCGA CACGATCCGT CGGTGCCTGG CGGCGCTGGC GATGCAGCGC
GACGAGTCCG GTTGCCCGGT GCGCGCCGGG GCATTCGAAA TTCTGATCTA CGCCAACAAT
TGCAGCGACA GCACCGTCGA GGTGGTCCGT CACTTTGCCT GCTGCATTCC GCATCCGATC
ATCGTGATCG AAGCACAATT GCCGCCGTCG CAACTCTCGG CCGGCGCGGC ACGCAAGACG
GCTATGGATC TCGCTGCCGC GAGGCTCGCC GAGCGTGGCG CAGCCGACGG GGTGATCCTC
ACGACCGATG CAGACAGCTG CGTAGCGCCG ACCTGGTTCT CGACGACGAT GCGGGAATTG
AGCGGCGGTG TGGATTGCGT TGCGGGATAC ATCGATGCCG AACCGCTCGA ACTGGTCGGC
CTCGGGCCGG CGTTTCTGGC GCGCGGTCGG CTCGAAGACG CGTATCTGAG ATTGATCGCC
GAAATCGACG CCCGTTGCGA TCCCCGCCGC CATGATCCCT GGCCGAACCA CCGTGTCGCG
TCCGGCGCGA GCCTGGCCGT GGTGTTGAAG GCCTATCTGG CCATCGGCGG GTTGCCGCTG
CGCGCGGTGG GCGAGGATGC CGCCCTCACC GCTGCGCTCG ACCGCGGGGG GTTCAAGGTG
CGACATTCCA TGGCCGTCTC GGTGACGACG TCGTGCCGGC TCGACGGTCG TGCGCAGGGC
GGCGCCGCCG ATACGATGCG GCTGCGCCAC GCGATGCCGG ACGCGCCCTG CGACGACGAT
CTCGAGCCGG CGTTGCAGGC GACCCGCCGC GCCATCTATC GCGGACGTCT GCGCCGGCTG
CTGGACGAAC AAAGGTATCG CGCCCGGCAG GTTCAGGATA TTCCGGCTCA GCAGCCACCG
CGCCCAGGCG CTACGTTCGA CGAGGCGTGG CAGCAGCTTT GTCGCGACAA TCCGGTTCTT
CGCCGCGGCG GGTCGTTGCG ACCGTCCGAT CTGCCGCGGC AGATCGCCGT TGCGACCATG
GTGCTACGGC ATCTGCGGCT GCCGCTCAGT GCGACGACAG TCGTTCCAGC CGATATGTCG
CGTCGCGAAC GATGGCTCGA GCCGGCAGCC TGA
 
Protein sequence
MFEQSSTRIS AATYGAVVAI PAHNEADTIR RCLAALAMQR DESGCPVRAG AFEILIYANN 
CSDSTVEVVR HFACCIPHPI IVIEAQLPPS QLSAGAARKT AMDLAAARLA ERGAADGVIL
TTDADSCVAP TWFSTTMREL SGGVDCVAGY IDAEPLELVG LGPAFLARGR LEDAYLRLIA
EIDARCDPRR HDPWPNHRVA SGASLAVVLK AYLAIGGLPL RAVGEDAALT AALDRGGFKV
RHSMAVSVTT SCRLDGRAQG GAADTMRLRH AMPDAPCDDD LEPALQATRR AIYRGRLRRL
LDEQRYRARQ VQDIPAQQPP RPGATFDEAW QQLCRDNPVL RRGGSLRPSD LPRQIAVATM
VLRHLRLPLS ATTVVPADMS RRERWLEPAA