Gene Rpal_4933 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4933 
Symbol 
ID6412624 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp5313845 
End bp5314924 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content73% 
IMG OID642714815 
ProductExtensin family protein 
Protein accessionYP_001993897 
Protein GI192293292 
COG category[S] Function unknown 
COG ID[COG3921] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.895839 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCGCGC TGGTTGCAGG CGGCACCTCG GCCGCGAACG CGCGTGAGCA CGTGCCGCTG 
CCGAAGCCGC GTCCGGCCGA GGCGCCGCAG GCGAATGCGC GCGAGGCCGA GCCCGGCGAG
GATGAGCCAA CGCCTGCGGA AGCTGCCGCC CCCGACAGCG CGCCCGCCGC CAAAGCCGAT
GCGGCGCAGG CTCCCAAGCC ACCGTCCGAA TGCCGGCTGG CCCTGACCGA GCAGATCGCG
ATCGCGCCGA GCATTCCGGA TATCACCGGG CCGGGCGCCT GCGGCGGATC CGATCTGGTC
CGGCTCGAAG CGGTGGTGCT GCCGGACGGC CGCCGTGTCT CGATGTCGCC GGCCGCGACG
CTGCGCTGCG GCATGGCGCG GGCGATCGCT GATTGGGTGC GCGCCGACAT CGCGCCGCTG
GCCGTCTCGC TCGGCAGCCG GGTCTCGGAT CTGGACAATT TCGACTCTTA TGAATGCCGC
GGCCGCAACC GGGTGCGCGG CGCTAAGCTC AGCGAGCACG GCCGTGCCAA TGCGCTCGAC
CTCCGCGGCA TCAAGCTCGC CGACGGGCGG ATGATTTCGC TGACTGACCG CGAGGCGCCG
CGCGCGCCCA GGGAAGCCGT GATGCAATCG GTGTGCGCGC GCTTCACGAC CGTGCTCGGT
CCAGGCTCCG ACGGCTATCA CGAGGACCAC ATCCACCTCG ATCTCGCCGA GCGCCGCGGC
GGCTACCGGA TGTGCCAATG GGCCTTATAC GAGGGGCTCC CGAATATTGC GCCGGTGATG
CCGCTGCCGC GCCCGGCCGA AGCGCCGCCG CGCGAAGTCG CGGCCGACGA CGAGCGCGCG
CCGCAGCAGG CAGCCCCGTC CCAGTCCGAG GCAGCCGAGC AGGCCCCGAC CGAGGAGGCC
GAACGCGAAC AGGCCGAGAC CCCACCGCCT CCGCCGCCGA AGCCGGCCAA GCGCGCCAAG
TCCAAGGCGG CCGCCGCGAA GCCGGCCGCG AGCAAGCCGA TCGATCTGAA GCCGCAAGCG
GCGCCGGCCG CAACGCCGGC GGCTCGCGGC AAGCCGGCGC CGACACGACC ACCGGTCTGA
 
Protein sequence
MIALVAGGTS AANAREHVPL PKPRPAEAPQ ANAREAEPGE DEPTPAEAAA PDSAPAAKAD 
AAQAPKPPSE CRLALTEQIA IAPSIPDITG PGACGGSDLV RLEAVVLPDG RRVSMSPAAT
LRCGMARAIA DWVRADIAPL AVSLGSRVSD LDNFDSYECR GRNRVRGAKL SEHGRANALD
LRGIKLADGR MISLTDREAP RAPREAVMQS VCARFTTVLG PGSDGYHEDH IHLDLAERRG
GYRMCQWALY EGLPNIAPVM PLPRPAEAPP REVAADDERA PQQAAPSQSE AAEQAPTEEA
EREQAETPPP PPPKPAKRAK SKAAAAKPAA SKPIDLKPQA APAATPAARG KPAPTRPPV